AssamInk

Microsoft Unveils Phi-4-Mini-Flash-Reasoning AI Model with 10x Faster Response Time

: | Updated On: 11-Jul-2025 @ 4:29 pm

Microsoft has launched Phi-4-mini-flash-reasoning, a compact AI model designed for efficient on-device reasoning in resource-limited settings like mobile and edge devices. Built on a new hybrid "SambaY" architecture, it offers 10x faster throughput and 2–3x lower latency than its predecessor, Phi-4-mini, without sacrificing reasoning accuracy. With 3.8 billion parameters and a 64k token context length, it's optimized for mathematical reasoning tasks. The model is available via NVIDIA API Catalog, Azure AI Foundry, and Hugging Face. Microsoft emphasizes safety and ethics through SFT, DPO, and RLHF, aligning with its commitment to openness, privacy, and inclusivity in AI development.

Contact Us

House. No. : 163, Second Floor Haridev Rd, near Puberun Path, Hatigaon,
Guwahati, Assam 781038.

E-mail : assaminkcontact@gmail.com

Contact : +91 8811887662

Enquiry

To the top

Microsoft Unveils Phi-4-Mini-Flash-Reasoning AI Model with 10x Faster Response Time

Contact Us

Enquiry

Reporter Login

Reporter Registration