Home / BI Tech / Meituan Open-Sources LongCat-Flash-Chat AI Model Globally

Meituan Open-Sources LongCat-Flash-Chat AI Model Globally

Sep 2, 2025

Tray DorbainBusiness Strategy Consultant

In an era where artificial intelligence is reshaping industries at an unprecedented pace, a groundbreaking development has emerged from one of China’s leading technology companies, setting a new benchmark for innovation and collaboration in the AI landscape. Meituan, a powerhouse in tech solutions, has taken a bold step by open-sourcing its cutting-edge large language model, LongCat-Flash-Chat, making it accessible to developers and researchers worldwide through popular platforms like GitHub and Hugging Face. This release, marking the company’s first foray into the public AI ecosystem, underscores a strategic commitment to fostering global advancements in AI technology. Under the umbrella of its ‘Building LLM’ initiative, Meituan aims to democratize access to powerful tools, encouraging collaborative progress. This move not only highlights the company’s expertise but also signals a shift toward shared innovation, where efficiency and scalability take center stage in addressing real-world challenges. The implications of this release are vast, promising to influence diverse sectors through accessible, high-performance AI.

Unveiling a Powerhouse in AI Efficiency

LongCat-Flash-Chat stands out as a technological marvel with its 560-billion-parameter framework, built on a Mixture-of-Experts (MoE) architecture that prioritizes both performance and resource efficiency. What makes this model particularly remarkable is its ability to activate only a fraction of its parameters—ranging from 18.6 to 31.3 billion per token, with an average of 27 billion—ensuring high capability without overwhelming computational demands. Innovative features like the ‘Zero-Computation Experts’ mechanism, a PID controller for activation stability, and inter-layer cross-channel pathways enhance its operational finesse. The result is a training process completed in a mere 30 days and an inference speed of 100 tokens per second on H800 GPUs, with output costs as low as 5 yuan per million tokens. This combination of speed and cost-effectiveness positions the model as a formidable competitor to established AI solutions, offering a practical alternative for developers seeking powerful yet economical tools for complex applications.

Driving Collaboration Through Agentic Innovation

The release of LongCat-Flash-Chat also reflects a broader industry trend toward creating AI systems that excel in agentic tasks, enabling autonomous, goal-oriented actions with precision. Meituan has invested significantly in this area, crafting a proprietary evaluation dataset and leveraging multi-agent methods to produce diverse, high-quality trajectory data for training. Techniques such as hyperparameter transfer, model stacking, and stability strategies further ensure robust performance. Beyond technical prowess, this open-source initiative aligns with Meituan’s multi-tiered AI strategy, spanning workplace integration, product enhancement, and large language model development. By sharing this technology, the company not only extends its focus on scalability and affordability but also invites global collaboration, reinforcing the importance of accessible innovation. Looking back, this moment marked a pivotal contribution to AI research, as Meituan paved the way for future explorations by prioritizing practical utility and collective growth in the field.