Microsoft Open Source New Version of Phi-4: Inference Efficiency Rises 10 Times, Can Run on Laptops

GoldenOctober2024

2025-07-09 22:12:24

Jin10 data reported on July 10, this morning, Microsoft open sourced the latest version of the Phi-4 family, Phi-4-mini-flash-reasoning, on its official website. The mini-flash version continues the Phi-4 family’s characteristics of small parameters and strong performance, specifically designed for scenarios limited by Computing Power, memory, and latency, capable of running on a single GPU, suitable for edge devices like laptops and tablets. Compared to the previous version, mini-flash utilizes Microsoft’s self-developed innovative architecture, SambaY, resulting in a big pump in inference efficiency by 10 times, with average latency reduced by 2-3 times, achieving a significant improvement in overall inference performance.

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Share

Comment

0/400

No comments

Topic
Pump.Fun Debuts on Gate
22 Popularity
Join Gate VIP to Win MacBook
27k Popularity
Trump Tariff Hikes
13k Popularity
4Gate xStocks Trading Share
20k Popularity
5HK Stablecoin Rules
10k Popularity
6Truth Social Crypto ETF
716 Popularity
7Gate Square Writing Contest Phase 1
5k Popularity
8Altcoin ETF Watch
4k Popularity
9Gate Alpha Trading Share
11k Popularity
10Dr.Han Joins Gate Square
45k Popularity

sitemap