DeepSeek’s most up-to-date replace, R1 0528, could also be referred to as a “minor trial improve,” nevertheless it makes main leaps. With appreciable developments in reasoning, code era, and dependability, R1 0528 locations DeepSeek as an open-source contender to Gemini 2.5 Professional. It even approaches OpenAI’s o3 and o4-mini by way of code benchmarks. On this weblog, we’ll discover the options of this improve and supply hands-on comparisons between DeepSeek R1 and the brand new R1.1.
What’s DeepSeek R1 0528?
DeepSeek R1 0528, additionally referred to as DeepSeek R1.1 is a next-generation, open-source giant language mannequin (LLM) launched by DeepSeek. This new mannequin represents a brand new era of AI methods going head-to-head with world-leading proprietary fashions by the likes of OpenAI and Google.
DeepSeek R1 0528 is especially centered on reasoning, code era, and sophisticated problem-solving skills. This makes it a game-changing useful resource for researchers, builders, and companies searching for to leverage highly effective AI methods, with out counting on a closed ecosystem and proprietary, subscription-based AI fashions.
Wish to study extra concerning the DeepSeek R1 mannequin? Click on right here.
What’s New in DeepSeek R1 0528?
DeepSeek R1 got here out earlier this 12 months with a bag of modern options. Now, with its newest replace, DeepSeek R1 0508, the open-source mannequin has turn into larger and higher.

A few of its upgraded options are:
- Large Parameter Scale: It’s educated on a whopping 671 billion parameters. This opens up lots of prospects for highly effective and environment friendly processing.
- Totally Open-Supply: It’s a totally open-source mannequin, permitting for transparency and community-driven improve potential.
- Smarter Reasoning: The mannequin achieves enhancements in reasoning capabilities, main to raised logic and problem-solving.
- Higher Code Technology: It offers higher accuracy and effectivity in code era, coming nearer to the main closed-source fashions.
- Enhanced Reliability: It’s also extra dependable and in keeping with responses.
- Longer Considering: The mannequin is able to considering for longer durations over complicated issues, showcasing a a lot better efficiency than its earlier model.
DeepSeek R1 0528: Efficiency Benchmarks
The sooner model of DeepSeek R1 blew the world away with its efficiency. It gave powerful competitors to all the favored fashions on the time and proved that open-source fashions could be at par with closed-source ones. Now, DeepSeek R1.1 additionally proves to be fairly comparable in its affect!
Let’s have a look at the efficiency of DeepSeek R1.1 in opposition to the highest fashions on the composite LLM efficiency rating, which is aggregated from benchmarks like MMLU, HumanEval, GSM8K, BBH, TruthfulQA, and many others.

With a median of 69.45, DeepSeek R1 0528 performs reliably throughout a variety of duties (e.g., reasoning, coding, math, and many others.). It delivers close to Claude-level median efficiency at a fraction of the fee, making it top-of-the-line value-for-money fashions on this listing. It outperforms Gemini 2.5 Professional and even Claude Sonnet 4 in consistency whereas costing 5x–7x much less.
Trying on the particular person benchmark checks, we are able to clearly see that the R1 0528 mannequin is a significant improve from the DeepSeek R1.

DeepSeek R1 0528 showcases distinctive mathematical abilities, gaining 2nd place within the AIME 2024 and 2025 benchmark checks, inching very near OpenAI’s o3. The identical is seen within the GPQA Diamond benchmark, LiveCode Bench, and Humanity’s Final Examination, additional proving the mannequin’s experience normally reasoning and coding.
DeepSeek R1 0528 vs DeepSeek R1: Arms-on Comparability
Now let’s get to a real-world comparability of DeepSeek R1 and R1 0528 on reasoning, code era, and reliability. We’ll be testing out each fashions on 3 completely different duties to see how they carry out and discover out if the brand new improve really does higher.
Job 1: Assemble a Consumer Interface Just like Instagram
Each fashions can be tasked with constructing an HTML web page that mimics the Instagram design. This may take a look at their code era high quality, structural understanding of UI, and correctness of logic in frontend growth.
Enter Immediate: “Create a responsive HTML and CSS structure that appears like Instagram’s principal feed web page. It ought to embrace:
A set high navigation bar with the Instagram emblem on the left, a search bar within the heart, and navigation icons (house, messages, discover, notifications, profile) on the correct.
A sidebar on the left for navigation with icons and labels much like Instagram (Dwelling, Search, Discover, Reels, Messages, Notifications, Create, Profile).
A principal feed within the heart exhibiting a number of submit playing cards. Every submit ought to embrace:
- A consumer avatar and username on the high
- A picture
- Motion icons (like, remark, share, save)
- A like rely
- A caption
- A remark part
A sidebar on the correct with ideas for “Folks chances are you’ll know.”
Output by DeepSeek R1 0528:
Output by DeepSeek R1:
Output Comparability
DeepSeek R1 0528 confirmed smoother rendering, a lot better UI responsiveness, and considerably extra aligned body synchronization with one another. The picture high quality appears barely sharper, too, in all probability due to improved inside sampling or export strategies.
DeepSeek R1’s transitions and rendering appear a bit slower compared. It has minimal body lags or delays concerning processing, significantly in action-dense areas or transitioning frames.
These observations indicate that the R1 0528 replace maybe consists of enhancements to efficiency and visible constancy.
Characteristic | DeepSeek R1-0528 | DeepSeek R1 |
Encoding | Correctly encoded | Lacking length metadata |
Rendering Fluidity | Easy and responsive | Minor lags on body transitions |
Visible High quality | Crisper visuals | Barely gentle |
Responsiveness | Improved, particularly in UI updates | Average |
Job 2: Planning a Journey to India
Every mannequin can be tasked with curating an economical journey to India. They must embrace an itinerary, together with prices and rationale. This may take a look at their planning abilities, information of the true world, in addition to their means to create structured and actionable content material.
Enter Immediate: “Plan a 3-day journey to India centered on tech, meals, and historical past with the worth of bills talked about.“
Output by DeepSeek R1 0528 (truncated):
3-Day India Journey: Delhi & Agra – Tech, Meals & Historical past Journey
(Finances: Mid-Vary Solo Traveler) 💰 Whole Value: ~₹25,000–30,000 ($300–360 USD)
Costs cowl meals, transport, entry charges, and lodging. Flights not included.
Day 1: Delhi – Historical past & Meals
Focus: Mughal/Colonial Historical past + Road Meals
Lodging: Mid-range lodge (Karol Bagh/Aerocity) – ₹2,500/evening
- Morning (Historical past):
- Pink Fort (UNESCO Web site): Discover Mughal structure. (Entry: ₹500)
- Jama Masjid: India’s largest mosque. (Free entry; ₹300 for tower entry)
- Lunch (Meals): Karim’s (Outdated Delhi) – Iconic Mughlai kebabs & biryani. (₹400)
- Afternoon (Historical past):
- Raj Ghat: Gandhi’s memorial. (Free)
- Humayun’s Tomb (UNESCO Web site): Pre-Taj Mughal marvel. (Entry: ₹600)
- Dinner (Meals): Paranthe Wali Gali – Fried breads with fillings. (₹200)
- Night: Stroll by means of Chandni Chowk market.
Every day Value: ₹4,500 (lodge, meals, transport, entries)
Output by Deepseek R1 (truncated):
Right here’s a well-balanced 3-day journey plan for India overlaying tech hubs, historic websites, and meals experiences, with estimated bills (in USD and INR for straightforward reference).
Day 1: Bangalore – The Silicon Valley of India
Theme: Tech + Fashionable Indian Meals
Morning: Tech Exploration
Go to Infosys Campus or Google Workplace (Outer View)– See India’s IT hub (no entry inside with out permission).
Indian Institute of Science (IISc) – Premier analysis institute (free entry, however verify visiting hours).
Visvesvaraya Industrial & Technological Museum – Interactive tech displays (Entry: $2 / ₹150).
Value: ~$5 / ₹400
Afternoon: Meals & Tradition
Lunch at Mavalli Tiffin Rooms (MTR) – Well-known for South Indian meals (~$8 / ₹600).
Industrial Road Procuring – Fast memento cease (non-obligatory).
Night: Fashionable Bangalore
UB Metropolis Mall – Excessive-end tech & luxurious manufacturers.
Dinner at Toit Brewpub – Craft beer & fusion meals (~$15 / ₹1,200).
Whole Day 1 Value: ~$30 / ₹2,500
You may view the entire output of each fashions right here.
Output Comparability
Each fashions present robust abilities in planning and producing structured content material. Nevertheless, there are variations within the model and depth of their journey planning.
DeepSeek R1 0528 plans a extra cohesive journey expertise utilizing a centered location (Delhi-Agra) to reduce journey overhead. On the similar time, it provides richness with experiences like tech-themed meals, marble workshops, and interpretations of up to date and trendy historical past round culinary traditions.
DeepSeek R1 covers a couple of main cities – Bangalore, Delhi, and Hyderabad, which definitely offers a breadth of publicity. Nevertheless, it comes at a a lot bigger price, making it much less viable for this specific use case.
Side | DeepSeek R1-0528 | DeepSeek R1 |
Itinerary Scope | Focuses on Delhi & Agra – compact, well-paced | Covers 3 cities (Bangalore, Delhi, Hyderabad) – broad however rushed |
Value Effectivity | ₹25,000–30,000 ($300–360 USD) – optimized journey & eating | ₹40,000–50,000 ($480–640 USD) – because of flights and luxurious eating |
Practicality | Larger – single base keep, quick day journey to Agra | Decrease – intercity journey is demanding in 3 days |
Expertise High quality | Immersive, themed days with cultural and trendy insights | Generic tech/historical past protection with much less depth |
Job 3: Logical Reasoning Job
Each fashions can be introduced with a logic puzzle in Physics so we are able to assess their problem-solving abilities, deductive reasoning abilities, and readability of explanations.
Enter Immediate: “You could have 3 mild bulbs in a room and three switches exterior. How do you establish which change controls which bulb in case you can solely enter the room as soon as?”
Output by DeepSeek R1 0528:
To find out which change controls which mild bulb with just one entry into the room, observe these steps:
- Label the three switches exterior the room as Change 1, Change 2, and Change 3 for reference.
- Activate Change 1 and go away it on for about 5 minutes.
- After 5 minutes, flip off Change 1 and instantly activate Change 2.
- Enter the room proper after turning on Change 2 and observe the state of every bulb:
- If a bulb is on, it’s managed by Change 2.
- If a bulb is off however heat to the contact, it’s managed by Change 1 (because it was on lengthy sufficient to warmth up however is now off).
- If a bulb is off and chilly to the contact, it’s managed by Change 3 (because it was by no means turned on).
Why this works:
- Through the use of the time delay with Change 1, you enable the bulb to warmth up. Whenever you flip it off, it stays heat for a brief interval.
- Turning on Change 2 instantly earlier than getting into ensures that its bulb is on.
- Change 3 stays off, so its bulb is off and chilly.
- This methodology leverages each the on/off state and the temperature of the bulb as indicators, permitting you to establish all three with a single entry.
Word: This resolution assumes the sunshine bulbs are incandescent and generate noticeable warmth when on. If the bulbs are LED or one other sort that doesn’t produce warmth, this methodology might not work, however conventional puzzles assume heat-generating bulbs.
Output by DeepSeek R1:
The Traditional Mild Bulb Puzzle: Resolution
Drawback Assertion:
You could have 3 switches exterior a room and three bulbs contained in the room. You may flip the switches on or off, however you’ll be able to solely enter the room as soon as to examine the bulbs. How do you establish which change controls which bulb?
Step-by-Step Resolution:
- Label the switches for readability (e.g., Change A, Change B, Change C).
- Activate Change A and go away it on for about 5 minutes.
– This length permits the bulb managed by Change A to warmth up (incandescent bulbs heat up noticeably).
- After 5 minutes, flip off Change A and instantly activate Change B.
– Now:
– Change A: Off (however bulb is heat)
– Change B: On (bulb is lit)
– Change C: Off (bulb is chilly and off)
- Enter the room to watch the bulbs:
– Bulb that’s ON → Managed by Change B (because it’s at present receiving energy).
– Bulb that’s OFF however WARM → Managed by Change A (was lately on and heated up).
– Bulb that’s OFF and COLD → Managed by Change C (was by no means turned on).
Key Perception:
- Makes use of warmth (from the bulb being on) and lightweight (present state) as indicators.
- Works solely with incandescent bulbs (not LEDs or CFLs, which don’t warmth up as noticeably).
Abstract Desk:
Bulb State | Controlling Change |
On | Change B |
Off however Heat | Change A |
Off and Chilly | Change C |
This methodology reliably maps all 3 switches to their bulbs in a single room entry.
Output Comparability
Each fashions precisely establish the puzzle resolution utilizing the usual method of utilizing warmth and lightweight to get rid of switches. Nevertheless:
DeepSeek R1 0528 offers a extra concise, readable, and environment friendly reply, the place it highlights sensible steps and features a useful word about bulb sort compatibility.
DeepSeek R1 presents a extra educational, formal group of textual content with useful headings, a abstract desk, and barely extra elaboration on all the choice steps.
Side | DeepSeek R1 | DeepSeek R1-0528 |
Correctness | Appropriate and full | Appropriate and full |
Readability of Rationalization | Barely verbose with a proper tone | Extra concise and user-friendly |
Construction & Format | Makes use of headers, markdown, and a abstract desk | Bullet-pointed, sequential; ends with a rationale paragraph |
Additional Perception | Highlights bulb varieties and situation assumptions | Provides word about LED vs incandescent compatibility |
End result Overview
DeepSeek R1 0528 confirmed important efficiency enhancements in comparison with R1 throughout all duties. In journey planning, R1 0528 offered a extra helpful cost-effective, and arranged itinerary. The video output evidenced a smoother and extra polished output that possible implies improved rendering. Within the logic puzzle exercise, each solved the duty appropriately, nevertheless, R1 0528 additionally gave a extra temporary and pure description. General, R1 0528 is extra usable, versatile/ related, and grounded for day-to-day duties.
Conclusion
DeepSeek R1 0528 is a substantial development within the democratization of superior AI applied sciences. It combines state-of-the-art efficiency with open-source availability to problem proprietary fashions with its compatibility and ease of use, whereas permitting extra people to leverage cutting-edge AI for analysis, growth, and enterprise use. Whether or not you’re a developer who desires to construct clever purposes, a researcher desirous to discover novel AI frontiers, or a enterprise wanting cost-effective and modern resolution suppliers, DeepSeek R1 0528 is a uniquely centered and succesful platform for the long run.
Login to proceed studying and revel in expert-curated content material.