Loading...
Experience the power of NVIDIA: Nemotron Nano 12B 2 VL integrated with Pluely's Invisible AI assistant. Perfect for meetings, interviews, and professional conversations.
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s memory-efficient sequence modeling for significantly higher throughput and lower latency.
The model supports inputs of text and multi-image documents, producing natural-language outputs. It is trained on high-quality NVIDIA-curated synthetic datasets optimized for optical-character recognition, chart reasoning, and multimodal comprehension.
Nemotron Nano 2 VL achieves leading results on OCRBench v2 and scores ≈ 74 average across MMMU, MathVista, AI2D, OCRBench, OCR-Reasoning, ChartQA, DocVQA, and Video-MME—surpassing prior open VL baselines. With Efficient Video Sampling (EVS), it handles long-form videos while reducing inference cost.
Open-weights, training data, and fine-tuning recipes are released under a permissive NVIDIA open license, with deployment supported across NeMo, NIM, and major inference runtimes.
Your conversations remain completely private. Pluely processes everything locally with no data sent to external servers.
Get AI-powered help during meetings, interviews, and presentations without anyone knowing. No visible interfaces or indicators.
Access the full capabilities of NVIDIA: Nemotron Nano 12B 2 VL through Pluely's seamless integration. All features available with Pro subscription.
Explore More
Discover other premium AI models and powerful features
Download for your platform, browse release history, or explore our development journey
Apple Silicon & Intel
x64 Architecture
Debian Package
Loading releases...
Latest release downloads
Browse all releases
Development timeline
Download Pluely now and experience the privacy-first AI assistant that works seamlessly in the background.