Hacker News new | past | comments | ask | show | jobs | submit | from login
Longwriter – Increase llama3.1 output to 10k words (github.com/thudm)
154 points by taikon 88 days ago | past | 29 comments
LongWriter solves the problem of LLM having inconsistent context or information (github.com/thudm)
1 point by ringer007 4 months ago | past
GLM-4-9B: open-source model with superior performance to Llama-3-8B (github.com/thudm)
66 points by marcelsalathe 7 months ago | past | 17 comments
CogAgent-18B – visual-based GUI Agent capabilities (github.com/thudm)
4 points by stevenhuang on Dec 17, 2023 | past | 1 comment
CogVLM: Visual Expert for Pretrained Language Models (github.com/thudm)
2 points by ilaksh on Nov 7, 2023 | past
CogVLM – a state-of-the-art-level open visual language model (github.com/thudm)
2 points by eunos on Oct 16, 2023 | past
AgentBench: Evaluating LLMs as Agents (github.com/thudm)
2 points by tikkun on Sept 16, 2023 | past | 1 comment
Thudm/AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents (github.com/thudm)
1 point by freediver on Aug 30, 2023 | past
AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents (github.com/thudm)
1 point by swyx on Aug 9, 2023 | past
WebGLM: Web-Enhanced Q&A with LLMs (github.com/thudm)
1 point by ignoramous on June 22, 2023 | past
ChatGLM-6B: run locally on consumer graphics card (6GB of GPU memory required) (github.com/thudm)
2 points by danboarder on April 19, 2023 | past
ChatGLM: Open bilingual language model based on General Language Model framework (github.com/thudm)
2 points by tvvocold on March 19, 2023 | past
ChatGLM-6B: open bilingual language model based on GLM framework (github.com/thudm)
4 points by thunderbong on March 17, 2023 | past
GLM-130B: An Open Bilingual Pre-Trained Model (github.com/thudm)
2 points by homarp on Oct 10, 2022 | past | 1 comment
CodeGeeX: An Open Multilingual Code Generative Model (github.com/thudm)
2 points by sanxiyn on Sept 22, 2022 | past
GLM-130B: An Open Bilingual Pre-Trained Model with 130B Parameters (github.com/thudm)
1 point by lnyan on Aug 6, 2022 | past
CogVideo: Code and Model for Text-to-Video Generation via Transformers (github.com/thudm)
3 points by lnyan on July 18, 2022 | past
CogView2: A 24 billion parameter text-to-image generation model (github.com/thudm)
1 point by lnyan on June 15, 2022 | past
CogVideo: Large-Scale Pretraining for Text-to-Video Generation via Transformers (github.com/thudm)
128 points by aero-glide2 on May 30, 2022 | past | 32 comments
CogVideo: Large-Scale Pretraining for Text-to-Video Generation via Transformers (github.com/thudm)
9 points by lnyan on May 30, 2022 | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: