Loading...
Experience the power of Arcee AI: Trinity Large Preview integrated with Pluely's Invisible AI assistant. Perfect for meetings, interviews, and professional conversations.
Trinity-Large-Preview is a frontier-scale open-weight language model from Arcee, built as a 400B-parameter sparse Mixture-of-Experts with 13B active parameters per token using 4-of-256 expert routing.
It excels in creative writing, storytelling, role-play, chat scenarios, and real-time voice assistance, better than your average reasoning model usually can. But we’re also introducing some of our newer agentic performance. It was trained to navigate well in agent harnesses like OpenCode, Cline, and Kilo Code, and to handle complex toolchains and long, constraint-filled prompts.
The architecture natively supports very long context windows up to 512k tokens, with the Preview API currently served at 128k context using 8-bit quantization for practical deployment. Trinity-Large-Preview reflects Arcee’s efficiency-first design philosophy, offering a production-oriented frontier model with open weights and permissive licensing suitable for real-world applications and experimentation.
Your conversations remain completely private. Pluely processes everything locally with no data sent to external servers.
Get AI-powered help during meetings, interviews, and presentations without anyone knowing. No visible interfaces or indicators.
Access the full capabilities of Arcee AI: Trinity Large Preview through Pluely's seamless integration. All features available with Pro subscription.
Explore More
Discover other premium AI models and powerful features
Download for your platform, browse release history, or explore our development journey
Apple Silicon & Intel
x64 Architecture
Debian Package
Loading releases...
Latest release downloads
Browse all releases
Development timeline
Download Pluely now and experience the privacy-first AI assistant that works seamlessly in the background.