Llama 3.2 Vision
by MetaMultimodal model with vision capabilities available in 11B and 90B parameter sizes. Supports image understanding and reasoning.
Specifications
- Context Window
- 128,000 tokens
- Released
- September 2024
Pricing
Open source - hosting costs vary
Capabilities
Related Models
Claude Opus 4
200K ctxby Anthropic
Anthropic's most capable model, excelling at complex analysis, nuanced content creation, and advanced coding tasks. Features superior reasoning and the ability to work autonomously on extended tasks.
Claude Sonnet 4
200K ctxby Anthropic
The best combination of performance and speed for efficient, high-throughput tasks. Excellent balance of intelligence and cost-effectiveness.
GPT-4o
128K ctxby OpenAI
OpenAI's flagship multimodal model with advanced reasoning, vision, and audio capabilities. Fast and versatile for most tasks.
o1
200K ctxby OpenAI
OpenAI's reasoning model designed to solve hard problems across science, coding, and math using chain-of-thought reasoning.