AI model and company signals, clearly sourced.
CyberOGZ organizes model benchmarks, company data, market signals and metadata in one place so readers can compare sources faster and spot what still needs verification before making decisions.
Companies shaping the AI stack.
Public market signals, private AI lab profiles, and provider news are source-labeled when API-synced.
NVIDIA Corp
NVDA | TechnologyGPU acceleration, CUDA ecosystem, data center AI infrastructure
Microsoft Corp
MSFT | TechnologyAzure AI infrastructure, Copilot distribution, OpenAI partnership
Alphabet Inc
GOOGL | Communication ServicesGemini, TPU infrastructure, Search and Workspace AI integration
ASML Holding NV
ASML | TechnologyEUV lithography and advanced chip manufacturing supply chain
Arm Holdings PLC
ARM | TechnologyCPU architecture and low-power AI device ecosystem
Anthropic
Private | Artificial IntelligenceWe're an AI research company that builds reliable, interpretable, and steerable AI systems. Our first product is Claude, an AI assistant for tasks at any scale.
Taiwan Semiconductor Manufacturing Co Ltd
TSM | TechnologyAdvanced AI chip manufacturing and foundry capacity
Alibaba Group Holding Ltd
BABA | Consumer CyclicalAlibaba Cloud, Qwen models and commerce AI workflows
Tencent Holdings Ltd
TCEHY | Communication ServicesCloud AI, gaming, social platforms and model ecosystem
OpenAI
Private | Artificial IntelligenceOpenAI is an AI research and deployment company. Our mission is to ensure that artificial general intelligence benefits all of humanity.
Apple Inc
AAPL | TechnologyOn-device AI, silicon, ecosystem distribution
Advanced Micro Devices Inc
AMD | TechnologyAI accelerators, CPUs, GPUs and data center compute
Intel Corp
INTC | TechnologyCPUs, AI PCs, accelerators and foundry strategy
Tesla Inc
TSLA | Consumer CyclicalAutonomy, robotics, inference compute and fleet data
Meta Platforms Inc
META | Communication ServicesLlama models, recommendation systems and AI products
Amazon.com Inc
AMZN | Consumer CyclicalAWS AI infrastructure, Bedrock, Trainium and retail AI
Broadcom Inc
AVGO | TechnologyNetworking silicon and custom AI accelerator supply chain
Oracle Corp
ORCL | TechnologyOCI GPU clusters, enterprise AI workloads and database AI
SAP SE
SAP | TechnologyEnterprise AI, business applications and data workflows
STMicroelectronics NV
STM | TechnologyIndustrial, automotive and edge-device semiconductor supply
Infineon Technologies AG
IFNNY | TechnologyPower, automotive and industrial chips supporting AI infrastructure
Siemens AG
SIEGY | IndustrialsIndustrial AI, automation software and digital twin systems
Baidu Inc
BIDU | Communication ServicesERNIE models, AI cloud and autonomous driving systems
Lenovo Group Ltd
LNVGY | TechnologyAI PCs, edge devices and enterprise hardware distribution
Samsung Electronics Co Ltd
SSNLF | TechnologyMemory, edge devices, mobile AI and semiconductor supply chain
Perplexity
Private | Artificial IntelligencePerplexity AI unlocks the power of knowledge with information discovery and sharing.
Mistral AI
Private | Artificial IntelligenceThe most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
xAI
Private | Artificial IntelligencexAI is a company working on building artificial intelligence to accelerate human scientific discovery. We are guided by our mission to advance our collective understanding of the universe.
Anthropic Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)
General, Reasoning
Top 10 imported score rows
- #1 Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) Anthropic 70
- #2 GPT-5.5 (xhigh) OpenAI 68
- #3 Claude Opus 4.8 (Adaptive Reasoning, Max Effort) Anthropic 67
- #4 Gemini 3.1 Pro Preview Google 67
- #5 GPT-5.5 (high) OpenAI 67
- #6 GPT-5.4 (xhigh) OpenAI 67
- #7 GPT-5.5 (medium) OpenAI 66
- #8 Qwen3.7 Max Alibaba 66
- #9 GPT-5.5 Pro (xhigh) OpenAI 66
- #10 Gemini 3 Deep Think Google 66
Top 8 by HLE result
HLE is used here as one external difficulty signal. Higher values mean more correct answers in that benchmark, where available.
- #1 Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) Anthropic 53%
- #2 Claude Opus 4.8 (Adaptive Reasoning, Max Effort) Anthropic 46%
- #3 Gemini 3.1 Pro Preview Google 45%
- #4 GPT-5.5 (xhigh) OpenAI 44%
- #5 GPT-5.5 (high) OpenAI 43%
- #6 GPT-5.4 (xhigh) OpenAI 42%
- #7 GPT-5.5 (medium) OpenAI 41%
- #8 Gemini 3.5 Flash (high) Google 41%
Who's building what
Grouped from the active benchmark records so readers can see which providers have multiple competitive models.
LG AI Research
- Flagship
- EXAONE 4.5 33B (Non-reasoning) 66
OpenAI
- Flagship
- GPT-5.5 (xhigh) 68
- Coding king
- GPT-5.5 Pro (xhigh) 70
- Best HLE
- 44%
Anthropic
- Flagship
- Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) 70
- Best HLE
- 53%
- Flagship
- Gemini 3.1 Pro Preview 67
- Coding king
- Gemini 3 Deep Think 70
- Best HLE
- 45%
Kimi
- Flagship
- Kimi K2.6 63
- Best HLE
- 36%
Alibaba
- Flagship
- Qwen3.7 Max 66
- Best HLE
- 38%
MiniMax
- Flagship
- MiniMax-M3 64
- Best HLE
- 37%
Deep Cogito
- Flagship
- Cogito v2.1 (Reasoning) 62
- Best HLE
- 11%
xAI
- Flagship
- Grok 4.3 (high) 63
- Coding king
- Grok 4.20 0309 (Reasoning) 42
- Best HLE
- 35%
DeepSeek
- Flagship
- DeepSeek V4 Pro (Reasoning, Max Effort) 63
- Best HLE
- 36%
Xiaomi
- Flagship
- MiMo-V2.5-Pro 62
- Best HLE
- 34%
Z AI
- Flagship
- GLM-5.1 (Reasoning) 61
- Coding king
- GLM-5 (Reasoning) 44
- Best HLE
- 28%
2026 model releases - through June
Each dot marks a recorded release date. Hover for the model name.
| Model | Score | Coding | Reasoning | Speed | Context | Modalities | Source |
|---|---|---|---|---|---|---|---|
|
Anthropic Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)
General
|
70 source-backed | 62 | 93 | 71 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.5 (xhigh)
General
|
68 source-backed | 59 | 94 | 72 | n/a | Text | Artificial Analysis |
|
Anthropic Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
General
|
67 source-backed | 57 | 92 | 70 | n/a | Text | Artificial Analysis |
|
Google Gemini 3.1 Pro Preview
General
|
67 source-backed | 56 | 94 | 78 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.5 (high)
General
|
67 source-backed | 58 | 93 | 70 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.4 (xhigh)
General
|
67 source-backed | 57 | 92 | 79 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.5 (medium)
General
|
66 source-backed | 56 | 93 | 70 | n/a | Text | Artificial Analysis |
|
Alibaba Qwen3.7 Max
General
China
|
66 source-backed | 50 | 92 | 81 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.5 Pro (xhigh)
General
|
66 source-backed | 70 | 70 | 35 | n/a | Text | Artificial Analysis |
|
Google Gemini 3 Deep Think
General
|
66 source-backed | 70 | 70 | 35 | n/a | Text | Artificial Analysis |
|
LG AI Research EXAONE 4.5 33B (Non-reasoning)
General
|
66 source-backed | 70 | 70 | 35 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-3.5 Turbo (0613)
General
|
66 source-backed | 70 | 70 | 35 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-4o mini Realtime (Dec '24)
General
Multimodal
|
66 source-backed | 70 | 70 | 35 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-4o Realtime (Dec '24)
General
Multimodal
|
66 source-backed | 70 | 70 | 35 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.3 Codex (xhigh)
General
Coding
|
65 source-backed | 53 | 92 | 75 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.4 Pro (xhigh)
General
|
65 source-backed | 70 | 70 | 35 | n/a | Text | Artificial Analysis |
|
Google Gemini 3.5 Flash (high)
General
|
64 source-backed | 45 | 92 | 82 | n/a | Text | Artificial Analysis |
|
Anthropic Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
General
|
64 source-backed | 52 | 91 | 68 | n/a | Text | Artificial Analysis |
|
Google Gemini 3.5 Flash (medium)
General
|
64 source-backed | 44 | 92 | 82 | n/a | Text | Artificial Analysis |
|
MiniMax MiniMax-M3
General
China
|
64 source-backed | 43 | 93 | 69 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.2 (xhigh)
General
|
64 source-backed | 49 | 99 | 73 | n/a | Text | Artificial Analysis |
|
Kimi Kimi K2.6
General
China
|
63 source-backed | 47 | 91 | 66 | n/a | Text | Artificial Analysis |
|
DeepSeek DeepSeek V4 Pro (Reasoning, Max Effort)
General
China
|
63 source-backed | 48 | 89 | 73 | n/a | Text | Artificial Analysis |
|
xAI Grok 4.3 (high)
General
|
63 source-backed | 41 | 90 | 80 | n/a | Text | Artificial Analysis |
|
Alibaba Qwen3.7 Plus
General
China
|
63 source-backed | 46 | 90 | 68 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.4 mini (xhigh)
General
|
63 source-backed | 52 | 88 | 80 | n/a | Text | Artificial Analysis |
|
Anthropic Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
General
|
62 source-backed | 48 | 90 | 69 | n/a | Text | Artificial Analysis |
|
Google Gemini 3 Flash Preview (Reasoning)
General
|
62 source-backed | 43 | 97 | 82 | n/a | Text | Artificial Analysis |
|
Xiaomi MiMo-V2.5-Pro
General
|
62 source-backed | 46 | 87 | 66 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.5 (low)
General
|
62 source-backed | 52 | 91 | 70 | n/a | Text | Artificial Analysis |
|
Anthropic Claude Opus 4.7 (Non-reasoning, High Effort)
General
|
62 source-backed | 53 | 88 | 67 | n/a | Text | Artificial Analysis |
|
Anthropic Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
General
|
62 source-backed | 51 | 88 | 71 | n/a | Text | Artificial Analysis |
|
Deep Cogito Cogito v2.1 (Reasoning)
General
|
62 source-backed | 25 | 73 | 72 | n/a | Text | Artificial Analysis |
|
Google Gemini 3 Pro Preview (high)
General
|
62 source-backed | 46 | 96 | 80 | n/a | Text | Artificial Analysis |
|
DeepSeek DeepSeek V4 Pro (Reasoning, High Effort)
General
China
|
61 source-backed | 43 | 90 | 73 | n/a | Text | Artificial Analysis |
|
xAI Grok 4.20 0309 v2 (Reasoning)
General
|
61 source-backed | 40 | 91 | 83 | n/a | Text | Artificial Analysis |
|
Alibaba Qwen3.6 Max Preview
General
China
|
61 source-backed | 45 | 89 | 66 | n/a | Text | Artificial Analysis |
|
Z AI GLM-5.1 (Reasoning)
General
China
|
61 source-backed | 43 | 87 | 71 | n/a | Text | Artificial Analysis |
|
Anthropic Claude Opus 4.5 (Reasoning)
General
|
61 source-backed | 48 | 91 | 70 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.1 (high)
General
|
61 source-backed | 45 | 94 | 77 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.2 Codex (xhigh)
General
Coding
|
60 source-backed | 43 | 90 | 78 | n/a | Text | Artificial Analysis |
|
xAI Grok 4.20 0309 (Reasoning)
General
|
60 source-backed | 42 | 88 | 82 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.4 (low)
General
|
60 source-backed | 46 | 87 | 76 | n/a | Text | Artificial Analysis |
|
xAI Grok 4.3 (medium)
General
|
60 source-backed | 35 | 89 | 80 | n/a | Text | Artificial Analysis |
|
MiniMax MiniMax-M2.7
General
China
|
60 source-backed | 42 | 87 | 66 | n/a | Text | Artificial Analysis |
|
Z AI GLM-5 (Reasoning)
General
China
|
60 source-backed | 44 | 82 | 72 | n/a | Text | Artificial Analysis |
|
NVIDIA Nemotron 3 Ultra 550B A55B (Reasoning)
General
|
60 source-backed | 38 | 87 | 81 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5 Codex (high)
General
Coding
|
60 source-backed | 39 | 99 | 81 | n/a | Text | Artificial Analysis |
|
Alibaba Qwen3.6 Plus
General
China
|
60 source-backed | 43 | 88 | 68 | n/a | Text | Artificial Analysis |
|
Xiaomi MiMo-V2.5
General
|
60 source-backed | 42 | 85 | 72 | n/a | Text | Artificial Analysis |
|
DeepSeek DeepSeek V4 Flash (Reasoning, Max Effort)
General
China
|
59 source-backed | 39 | 89 | 75 | n/a | Text | Artificial Analysis |
|
Xiaomi MiMo-V2-Pro
General
|
59 source-backed | 41 | 87 | 71 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.1 Codex (high)
General
Coding
|
59 source-backed | 37 | 96 | 83 | n/a | Text | Artificial Analysis |
|
KwaiKAT KAT Coder Pro V2
General
Coding
|
59 source-backed | 46 | 86 | 77 | n/a | Text | Artificial Analysis |
|
Google Gemini 3.5 Flash (minimal)
General
|
59 source-backed | 47 | 83 | 86 | n/a | Text | Artificial Analysis |
|
OpenAI GPT-5.4 nano (xhigh)
General
|
58 source-backed | 44 | 82 | 82 | n/a | Text | Artificial Analysis |
How to read the index
What affects it
The index gives more weight to broad model quality, then includes coding, reasoning, speed, context and price efficiency. Treat close numbers as directional rather than absolute.
Overall is not a claim that one model is universally better than another. It is a compact reading aid for CyberOGZ readers, based on imported benchmark, speed, context and pricing signals at the time the data was added.
Use it to scan the field quickly, then compare the specific columns and source notes for the task you care about. A lower-index model can still be the better choice for a specific workflow.
Data sources and confidence
The benchmark feed can come from API-backed sources, imported JSON, and admin-reviewed manual rows. Each row has a source and confidence value. Higher confidence means the row is based on a clearer external source or a cleaner imported feed.
If a model has only metadata but no trusted benchmark data, it should stay inactive or metadata-only in admin. That prevents unscored models from polluting the benchmark lists.