Login

Admin Panel

View your account balance, API Key, models, and latest announcements.

Anthropic
multimodal

Claude Haiku 4.5

claude-haiku-4-5 is a model provided by Anthropic

Input: $1.10 / Output: $5.50
Input: text, image / Output: text, json
October 16, 2025
Anthropic
multimodal

Claude Haiku 4.5

claude-haiku-4-5 is a model provided by Anthropic

Input: $1.10 / Output: $5.50
Input: text, image / Output: text, json
October 16, 2025
Fireworks
text-only

DeepSeek V3.1 Terminus

DeepSeek-V3.1-Terminus is an updated version of DeepSeek-V3.1 with enhanced language consistency, reduced mixed Chinese-English text, and optimized Code Agent and Search Agent performance.

Input: $0.62 / Output: $1.85
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

Kimi K2 Instruct 0905

Kimi K2 0905 is an updated version of Kimi K2, a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Kimi K2 0905 has improved coding abilities, a longer context window, and agentic tool use, and a longer (262K) context window.

Input: $0.66 / Output: $2.75
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

DeepSeek V3.1

DeepSeek-V3.1 is post-trained on the top of DeepSeek-V3.1-Base, which is built upon the original V3 base checkpoint through a two-phase long context extension approach, following the methodology outlined in the original DeepSeek-V3 report. We have expanded our dataset by collecting additional long documents and substantially extending both training phases. The 32K extension phase has been increased 10-fold to 630B tokens, while the 128K extension phase has been extended by 3.3x to 209B tokens. Additionally, DeepSeek-V3.1 is trained using the UE8M0 FP8 scale data format to ensure compatibility with microscaling data formats.

Input: $0.62 / Output: $1.85
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

OpenAI gpt-oss-120b

Welcome to the gpt-oss series, OpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. gpt-oss-120b is used for production, general purpose, high reasoning use-cases that fits into a single H100 GPU.

Input: $0.17 / Output: $0.66
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

OpenAI gpt-oss-20b

Welcome to the gpt-oss series, OpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. gpt-oss-20b is used for lower latency, and local or specialized use-cases.

Input: $0.08 / Output: $0.33
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

Qwen3 235B A22B Thinking 2507

Latest Qwen3 thinking model, competitive against the best close source models in Jul 2025.

Input: $0.24 / Output: $0.97
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

Qwen3 Coder 480B A35B Instruct

Qwen3's most agentic code model to date

Input: $0.50 / Output: $1.98
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

Qwen3 235B A22B Instruct 2507

Updated FP8 version of Qwen3-235B-A22B non-thinking mode, with better tool use, coding, instruction following, logical reasoning and text comprehension capabilities

Input: $0.24 / Output: $0.97
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

Kimi K2 Instruct

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Trained with the Muon optimizer, Kimi K2 achieves exceptional performance across frontier knowledge, reasoning, and coding tasks while being meticulously optimized for agentic capabilities.

Input: $0.66 / Output: $2.75
Input: text / Output: text, json
October 16, 2025
Fireworks
text-only

GLM-4.5

The GLM-4.5 series models are foundation models designed for intelligent agents. GLM-4.5 has 355 billion total parameters with 32 billion active parameters, while GLM-4.5-Air adopts a more compact design with 106 billion total parameters and 12 billion active parameters. GLM-4.5 models unify reasoning, coding, and intelligent agent capabilities to meet the complex demands of intelligent agent applications.

Input: $0.61 / Output: $2.41
Input: text / Output: text, json
October 16, 2025

Platform Features

API Access

Unified OpenAI-compatible API

Model Library

Multiple AI providers available

Usage Analysis

Detailed consumption tracking

SetupClaude Code

Run npx omgvibe to configure your CLI proxy

Current Balance
Loading financial data...

Announcements

Support for Response API and Latest OpenAI Models
Support for OpenAI Response API (Conversations/Items API) with platform-managed states. New models: gpt-5-pro, o3-pro, gpt-image-1-mini, gpt-audio, and gpt-realtime series.
CodeX adds gpt-5-codex, new setup script, and billing fix
CodeX endpoint now supports gpt-5-codex, ships an npx omgvibe setup helper, and corrects the 5% billing issue.
CodeX Support (5% pricing), Nano Banana, Parameter Overrides, and more
New CodeX Responses API (5% pricing), support for vertex-gemini-2.5-flash-image-preview, Parameter Overrides (Public Beta), and Claude Code now at 5%.
Flex Mode Half-Price Billing Launched
New API pricing option! Set service_tier to flex in your requests for 50% off. Supports all GPT-5 and O series models.
Limited-Time Free Claude Models
Experience Claude models for free! Opus 4: 50 times/day, Sonnet 4: 100 times/day, Haiku 3.5: 500 times/day. Try the most powerful AI assistant now!
GPT-5 Series Models Now Available
Breaking! GPT-5 series models are now live, including GPT-5, GPT-5 Mini, GPT-5 Nano and more. Experience the next generation of AI!
Claude Code Proxy Service Launched
New Claude Code proxy service is now live! Pay-as-you-go at 10% of official price, no subscription needed. Supports GitHub Actions.
New Website Launched, Welcome Your Feedback
We are excited to announce that the new OhMyGPT website is now officially live! Completely redesigned interface for better usability.
OhMyGPT