| | Longwriter – Increase llama3.1 output to 10k words (github.com/thudm) |
|
154 points by taikon 88 days ago | past | 29 comments
|
| | LongWriter solves the problem of LLM having inconsistent context or information (github.com/thudm) |
|
1 point by ringer007 4 months ago | past
|
| | GLM-4-9B: open-source model with superior performance to Llama-3-8B (github.com/thudm) |
|
66 points by marcelsalathe 7 months ago | past | 17 comments
|
| | CogAgent-18B – visual-based GUI Agent capabilities (github.com/thudm) |
|
4 points by stevenhuang on Dec 17, 2023 | past | 1 comment
|
| | CogVLM: Visual Expert for Pretrained Language Models (github.com/thudm) |
|
2 points by ilaksh on Nov 7, 2023 | past
|
| | CogVLM – a state-of-the-art-level open visual language model (github.com/thudm) |
|
2 points by eunos on Oct 16, 2023 | past
|
| | AgentBench: Evaluating LLMs as Agents (github.com/thudm) |
|
2 points by tikkun on Sept 16, 2023 | past | 1 comment
|
| | Thudm/AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents (github.com/thudm) |
|
1 point by freediver on Aug 30, 2023 | past
|
| | AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents (github.com/thudm) |
|
1 point by swyx on Aug 9, 2023 | past
|
| | WebGLM: Web-Enhanced Q&A with LLMs (github.com/thudm) |
|
1 point by ignoramous on June 22, 2023 | past
|
| | ChatGLM-6B: run locally on consumer graphics card (6GB of GPU memory required) (github.com/thudm) |
|
2 points by danboarder on April 19, 2023 | past
|
| | ChatGLM: Open bilingual language model based on General Language Model framework (github.com/thudm) |
|
2 points by tvvocold on March 19, 2023 | past
|
| | ChatGLM-6B: open bilingual language model based on GLM framework (github.com/thudm) |
|
4 points by thunderbong on March 17, 2023 | past
|
| | GLM-130B: An Open Bilingual Pre-Trained Model (github.com/thudm) |
|
2 points by homarp on Oct 10, 2022 | past | 1 comment
|
| | CodeGeeX: An Open Multilingual Code Generative Model (github.com/thudm) |
|
2 points by sanxiyn on Sept 22, 2022 | past
|
| | GLM-130B: An Open Bilingual Pre-Trained Model with 130B Parameters (github.com/thudm) |
|
1 point by lnyan on Aug 6, 2022 | past
|
| | CogVideo: Code and Model for Text-to-Video Generation via Transformers (github.com/thudm) |
|
3 points by lnyan on July 18, 2022 | past
|
| | CogView2: A 24 billion parameter text-to-image generation model (github.com/thudm) |
|
1 point by lnyan on June 15, 2022 | past
|
| | CogVideo: Large-Scale Pretraining for Text-to-Video Generation via Transformers (github.com/thudm) |
|
128 points by aero-glide2 on May 30, 2022 | past | 32 comments
|
| | CogVideo: Large-Scale Pretraining for Text-to-Video Generation via Transformers (github.com/thudm) |
|
9 points by lnyan on May 30, 2022 | past | 1 comment
|