Loading...
Experience the power of Meituan: LongCat Flash Chat integrated with Pluely's Invisible AI assistant. Perfect for meetings, interviews, and professional conversations.
LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce communication overhead and achieve high throughput while maintaining training stability through advanced scaling strategies such as hyperparameter transfer, deterministic computation, and multi-stage optimization.
This release, LongCat-Flash-Chat, is a non-thinking foundation model optimized for conversational and agentic tasks. It supports long context windows up to 128K tokens and shows competitive performance across reasoning, coding, instruction following, and domain benchmarks, with particular strengths in tool use and complex multi-step interactions.
Your conversations remain completely private. Pluely processes everything locally with no data sent to external servers.
Get AI-powered help during meetings, interviews, and presentations without anyone knowing. No visible interfaces or indicators.
Access the full capabilities of Meituan: LongCat Flash Chat through Pluely's seamless integration. All features available with Pro subscription.
Explore More Models
Discover other premium AI models available with Pluely Pro
Download for your platform, browse release history, or explore our development journey
Apple Silicon & Intel
x64 Architecture
Debian Package
Latest release downloads
Browse all releases
Development timeline
Loading releases...
Download Pluely now and experience the privacy-first AI assistant that works seamlessly in the background.