AI Model

StepFun: Step 3.7 Flash

Stepfun
Text Generation
Reasoning
Vision
About Step 3.7 Flash

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters per token. The model supports a 256K context window and exposes selectable reasoning levels (high/medium/low), letting callers trade off speed, cost, and depth of reasoning.

Designed for coding, agentic workflows, structured outputs, and long-context productivity tasks.

Specifications
Provider
Stepfun
Context Length
256,000 tokens
Input Types
text, image, video
Output Types
text
Category
Other
Added
5/28/2026

Use Step 3.7 Flash and 200+ more models

Access all the best AI models in one platform. No API keys, no switching between apps.