Nhịp AI – Bản tin tuần 34

Nhịp AI tuần 34: Gemini dẫn đầu, GPT-5 bứt phá y học, DeepSeek mở chuẩn mã nguồn mở, chuẩn hóa MCP & POML, MIT báo 95% dự án GenAI thất bại.

Aug 24, 2025

📰 Tuần 34 - Tóm tắt
(từ 18/08/2025- 24/08/2025)

Tuần 34 ghi dấu những bước tiến dồn dập của trí tuệ nhân tạo: Gemini 2.5 Pro vượt lên dẫn đầu bảng xếp hạng LMArena, GPT-5 chứng minh ưu thế vượt trội trong y học, trong khi DeepSeek V3.1 đặt chuẩn mới cho mã nguồn mở với chi phí thấp hơn hàng chục lần. Song song đó, Google, Anthropic và Microsoft lần lượt công bố các giao thức và ngôn ngữ chuẩn hóa mới: “MCP (Anthropic), A2A (Google), POML (Microsoft), cho thấy xu hướng AI đang chuyển từ “đua benchmark” sang “thiết lập tiêu chuẩn tương tác”.

Các báo cáo nghiên cứu nhắc nhở thực tế: 95% dự án GenAI trong doanh nghiệp thất bại, nhưng những ứng dụng cụ thể như gaming hay tự động hóa back-office lại chứng minh giá trị rõ rệt. Thị trường vốn cũng sôi động với Anthropic đàm phán gọi vốn 10 tỷ USD, trong khi ByteDance và Alibaba tiếp tục đẩy mạnh thế trận mã nguồn mở.

Từ các giải đấu robot hình người tại Bắc Kinh cho đến sự ra mắt Pixel 10 với chip Tensor G5, tuần này phản ánh bức tranh AI vừa sôi động, vừa phức tạp: cạnh tranh mô hình khốc liệt, chuẩn hóa hạ tầng tăng tốc, thực tế triển khai nhiều va vấp, nhưng triển vọng ứng dụng ngày càng rõ ràng.

Các mô hình và công nghệ đột phá

Sự vượt trội của Gemini trong các bảng xếp hạng

Theo các báo cáo từ LMArena và thị trường dự đoán Kalshi, Gemini 2.5 Pro đã vươn lên vị trí đầu trong bảng xếp hạng LMArena. Điều đáng chú ý là 57% những người đặt cược trên Kalshi tin rằng Gemini sẽ trở thành mô hình AI văn bản tốt nhất vào cuối năm 2025, tăng từ 48.1% lên 57% chỉ trong 24 giờ, theo Cointribune vào ngày 19/8/2025 [1]. Trong khi đó, Claude tiếp tục dẫn đầu trong lĩnh vực lập trình.

GPT-5 và các tiến bộ y học

GPT-5 đạt được 95.84% độ chính xác trên bộ dữ liệu MedQA [5], thể hiện sự cải thiện đáng kể so với GPT-4o. Nghiên cứu cho thấy GPT-5 vượt trội hơn cả các chuyên gia y tế đã được cấp phép trong nhiều khía cạnh, đặc biệt là trong lập luận đa phương thức y học. Microsoft đã tích hợp GPT-5 vào toàn bộ hệ sinh thái sản phẩm bao gồm Microsoft 365 Copilot, GitHub Copilot, Visual Studio và Azure AI. [6]

DeepSeek V3.1 - Đột phá về lập trình mã nguồn mở

DeepSeek V3.1 đạt 71.6% điểm số trong bài kiểm tra lập trình Aider, vượt qua Claude Opus trong khi có chi phí thấp hơn 68 lần. Mô hình 685B tham số này hỗ trợ ngữ cảnh 128k token và được phát hành dưới giấy phép mã nguồn mở. [4] [8] [9]. Các thông số vẫn cần phải chờ các bài báo chính thức.

Công cụ và giao thức mới

Microsoft POML (Prompt Orchestration Markup Language)

Microsoft phát hành POML, một ngôn ngữ đánh dấu kiểu HTML/XML được thiết kế cho việc tạo prompt phức tạp cho LLM. POML cung cấp cấu trúc component-based, tích hợp dữ liệu liền mạch, và hệ thống styling tách biệt giống CSS. [13] [14] [15]

Google LangExtract

Google ra mắt thư viện Python LangExtract để trích xuất thông tin có cấu trúc từ văn bản không có cấu trúc. Thư viện này sử dụng các kỹ thuật tạo có kiểm soát để đảm bảo thông tin được trích xuất chính xác và liên kết với vị trí gốc trong văn bản. [16] [17] [18]

Mô hình âm thanh và hình ảnh tiên tiến

NVIDIA Canary-1b và Parakeet-tdt-0.6b

NVIDIA công bố các mô hình âm thanh Canary-1b và Parakeet-tdt-0.6b. Canary-1b-v2 hỗ trợ 25 ngôn ngữ châu Âu và có khả năng chuyển đổi giọng nói sang văn bản cũng như dịch thuật âm thanh. Parakeet TDT 0.6B tập trung vào tốc độ chuyển đổi âm thanh sang văn bản cực nhanh. [19]

Google Nano Banana

Google được cho phát triển "Nano Banana" - một mô hình chỉnh sửa hình ảnh AI được cho là vượt trội hơn GPT-4o về độ chân thực và đa dạng phong cách. Mô hình này cho phép chỉnh sửa hình ảnh bằng ngôn ngữ tự nhiên mà không cần các công cụ phức tạp. [20] [21] [22]. Tuy nhiên, đây là những thông tin chưa chưa chính thức, được đồn khá nhiều trên Social Networks trong những ngày qua.

AI-edited photo of a woman holding a designer handbag featuring a wave art print using Google Nano Banana.

Qwen-Image-Edit 20B

Alibaba phát hành Qwen-Image-Edit, mô hình chỉnh sửa hình ảnh 20B tham số hỗ trợ chỉnh sửa văn bản song ngữ (Trung-Anh) và các phép biến đổi ngữ nghĩa phức tạp. Mô hình sử dụng kiến trúc dual-path để kiểm soát ngữ nghĩa và diện mạo một cách độc lập. [23] [24] [25]

ElevenLabs Eleven v3 Alpha

ElevenLabs ra mắt Eleven v3 Alpha API hỗ trợ hơn 70 ngôn ngữ và khả năng tạo ra cuộc hội thoại đa nhân vật. API mới bao gồm chế độ Dialogue cho phép tạo ra các cuộc hội thoại tự nhiên với thay đổi giọng điệu và cảm xúc. [26] [27] [28] [29]

Nghiên cứu và báo cáo quan trọng

MIT: 95% dự án GenAI thất bại

Báo cáo từ MIT NANDA cho thấy 95% các dự án thí điểm GenAI trong doanh nghiệp thất bại. Nghiên cứu dựa trên 300 triển khai AI công khai và 150 cuộc phỏng vấn lãnh đạo ngành cho thấy chỉ 5% đạt được tăng trưởng doanh thu nhanh chóng. Tuy nhiên, AI cực kỳ thành công trong tự động hóa back-office. [30] [31] [32] [33]

90% nhà phát triển game sử dụng AI

Nghiên cứu của Google Cloud cho thấy 87% nhà phát triển trò chơi điện tử sử dụng AI agents. 94% người được khảo sát tin rằng AI sẽ giảm chi phí phát triển tổng thể trong dài hạn, mặc dù 25% thừa nhận khó đo lường ROI từ đầu tư AI. [34] [35] [36] [37] [38]

Anthropic huy động 10 tỷ USD

Anthropic đang đàm phán vòng gọi vốn 10 tỷ USD, tăng gấp đôi so với mục tiêu ban đầu 5 tỷ USD do nhu cầu cao từ các nhà đầu tư. Vòng gọi vốn này sẽ giúp định giá công ty lên khoảng 170 tỷ USD. [39] [40] [41] [42]

Sự kiện và triển lãm

World Humanoid Robot Games tại Bắc Kinh

Giải thi đấu robot hình người đầu tiên trên thế giới diễn ra tại Bắc Kinh với 500 robot từ 280 đội thuộc 16 quốc gia. Sự kiện 3 ngày bao gồm 26 môn thi từ bóng đá, boxing đến sắp xếp thuốc và làm sạch. [43] [44] [45] [46]

Một robot đã giành chiến thắng rõ ràng trước các đối thủ cạnh tranh nhưng lại chậm hơn đáng kể so với kỷ lục của con người. Nguồn: Tingshu Wang/REUTERS

Google Pixel 10 ra mắt

Google Pixel 10 được công bố vào ngày 20/8 với nhiều tính năng AI mới được hỗ trợ bởi chip Tensor G5 và mô hình Gemini Nano. Các tính năng nổi bật bao gồm Magic Cue (gợi ý thông tin chủ động), Camera Coach, và Voice Translate cho cuộc gọi thời gian thực. [47] [48] [49] [50]

Google released an updated version of its folding phone, the Pixel 10 Pro Fold, which will cost at least $1799. — Google đã phát hành phiên bản cập nhật của điện thoại màn hình gập Pixel 10 Pro Fold, có giá ít nhất là 1799 đô la. [48]

Mô hình mã nguồn mở quan trọng khác

ByteDance Seed-OSS-36B

ByteDance phát hành Seed-OSS-36B với ngữ cảnh 512K token - gấp đôi GPT-5 và gấp 4 lần các mô hình mã nguồn mở chính thống. Mô hình 36B tham số này được phát hành dưới giấy phép Apache-2.0 và có tính năng "thinking budget" để kiểm soát độ sâu lý luận. [51] [52] [53] [54]

Tổng kết

Tuần 34 cho thấy bức tranh AI toàn cầu vừa sôi động, vừa phức tạp: từ những đột phá mô hình như Gemini, GPT-5 hay DeepSeek, đến những bước tiến về tiêu chuẩn hóa (MCP, POML), rồi cả thực tế phũ phàng khi MIT nhấn mạnh 95% dự án GenAI doanh nghiệp thất bại. Song song đó, các ứng dụng thực tiễn trong gaming, back-office, cũng như làn sóng vốn đầu tư vào Anthropic hay ByteDance, khẳng định rằng AI không chỉ là câu chuyện công nghệ, mà là câu chuyện kinh tế, xã hội và chiến lược.

Nhìn lại, ta thấy ngành AI đang dịch chuyển từ cơn sốt benchmark sang thời kỳ của sự trưởng thành và tin cậy.

👉 Hẹn gặp lại bạn trong Nhịp AI – Bản tin tuần 35, chúng tôi sẽ tiếp tục mang đến cái nhìn toàn diện nhất về những chuyển động công nghệ, thị trường và tác động xã hội của AI.

Tham khảo

Các nguồn tham khảo theo định dạng IEEE:

[1] F. L., "57% of Kalshi bettors predict Gemini will become the best AI model in 2025," Cointribune, 19 Aug. 2025. [Online]. Available: https://www.cointribune.com/en/57-of-kalshi-bettors-predict-gemini-will-become-the-best-ai-model-in-2025/

[4] 36Kr Editorial Team, "DeepSeek V3.1 quietly released - The new AI programming benchmark model," 36Kr, 20 Aug. 2025. [Online]. Available: https://eu.36kr.com/en/p/3430524032372096

[5] A. Shapiro, "GPT-5 surpasses doctors in medical reasoning benchmarks," AI News, 13 Aug. 2025. [Online]. Available:

https://www.ainews.com/p/gpt-5-surpasses-doctors-in-medical-reasoning-benchmarks

[6] S. Wang, M. Hu, Q. Li, M. Safari, and X. Yang, "Capabilities of GPT-5 on multimodal medical reasoning," arXiv preprint arXiv:2508.08224, Aug. 2025. [Online]. Available: https://arxiv.org/pdf/2508.08224.pdf

[7] Vals.ai Team, "MedQA benchmark results - August 12, 2025," Vals.ai, 12 Aug. 2025. [Online]. Available: https://www.vals.ai/benchmarks/medqa-08-12-2025

[8] C. Zmilo, "DeepSeek V3.1 complete evaluation analysis: The new AI programming benchmark for 2025," Dev.to, 19 Aug. 2025. [Online]. Available: https://dev.to/czmilo/deepseek-v31-complete-evaluation-analysis-the-new-ai-programming-benchmark-for-2025-58jc

[9] VentureBeat Editorial Team, "DeepSeek V3.1 just dropped and it might be the most powerful open AI yet," VentureBeat, 19 Aug. 2025. [Online]. Available: https://venturebeat.com/ai/deepseek-v3-1-just-dropped-and-it-might-be-the-most-powerful-open-ai-yet/

[13] Turtles AI Team, "Microsoft's POML: The invisible thread that cleans prompt engineering," Turtles AI, Aug. 2025. [Online]. Available: https://www.turtlesai.com/en/pages-3105/microsoft_s_poml_the_invisible_thread_that_cleans

[14] A. Razzaq, "Microsoft releases POML (Prompt Orchestration Markup Language)," Marktechpost, 13 Aug. 2025. [Online]. Available: https://www.marktechpost.com/2025/08/13/microsoft-releases-poml-prompt-orchestration-markup-language/

[15] Y. Zhang, N. Chen, J. Xu, and Y. Yang, "Prompt Orchestration Markup Language," arXiv preprint arXiv:2508.13948v1, Aug. 2025. [Online]. Available: https://arxiv.org/html/2508.13948v1

[16] Telerik Team, "Step-by-step guide: Using LangExtract with OpenAI," Telerik Blogs, Aug. 2025. [Online]. Available: https://www.telerik.com/blogs/step-by-step-guide-using-langextract-openai

[17] AI Engineering Team, "Google released Python library for data extraction," AI Engineering Newsletter, Aug. 2025. [Online]. Available:

https://aiengineering.beehiiv.com/p/google-released-python-library-for-data-extraction

[18] D. Dominguez, "Google launches LangExtract: Python library for structured information extraction," InfoQ, Aug. 2025. [Online]. Available: https://www.infoq.com/news/2025/08/google-langextract-python/

[19] Ossels AI Team, "NVIDIA Canary 1B & Parakeet TDT 0.6B: Voice AI models revolutionizing speech recognition," Ossels AI, 15 Aug. 2025. [Online]. Available: https://ossels.ai/nvidia-canary-1b-parakeet-tdt-0-6b-voice-ai/

[20] MagicShot AI Team, "Meet Nano Banana: Google's smartest AI image editor yet," MagicShot AI, Aug. 2025. [Online]. Available: https://magicshot.ai/news/meet-nano-banana-googles-smartest-ai-image-editor-yet/

[21] Business Insider Team, "Bananas: Google's viral AI model goes mainstream," Business Insider, 21 Aug. 2025. [Online]. Available: https://www.businessinsider.com/bananas-google-viral-ai-model-2025-8

[22] FluxPro Team, "Meet Google Nano Banana: A game-changing AI image generator & editor," FluxPro Web, Aug. 2025. [Online]. Available: https://fluxproweb.com/blog/detail/Meet-Google-Nano-Banana-A-Game-Changing-AI-Image-Generator-Editor-be80db430603/

[23] Qwen Team, "Qwen-Image-Edit: Image editing with higher quality and efficiency," Qwen Blog, 19 Aug. 2025. [Online]. Available: https://qwenlm.github.io/blog/qwen-image-edit/

[24] Viblo Editorial, "Is Qwen-Image-Edit the 2025 breakthrough image-editing AI?" Viblo, Aug. 2025. [Online]. Available:

https://viblo.asia/p/is-qwen-image-edit-the-2025-breakthrough-image-editing-ai-gdJzvb1vJz5

[25] Marktechpost Team, "Qwen team introduces Qwen-Image-Edit: The image editing version of Qwen-Image with advanced capabilities," Marktechpost, 18 Aug. 2025. [Online]. Available: https://www.marktechpost.com/2025/08/18/qwen-team-introduces-qwen-image-edit-the-image-editing-version-of-qwen-image-with-advanced-capabilities-for-semantic-and-appearance-editing/

[26] AI Base News Team, "Qwen Image Edit: Revolutionary AI-powered image editing breakthrough," AI Base, 20 Aug. 2025. [Online]. Available: https://news.aibase.com/news/20693

[27] ElevenLabs Team, "Changelog: August 20, 2025 - ElevenLabs v3 Alpha release," ElevenLabs Documentation, 20 Aug. 2025. [Online]. Available: https://elevenlabs.io/docs/changelog/2025/8/20

[28] ElevenLabs Team, "Introducing Eleven v3 (alpha) - Our most expressive text to speech model," ElevenLabs, 2025. [Online]. Available: https://elevenlabs.io/v3

[29] ElevenLabs Team, "Eleven v3 (alpha) now available in the API," ElevenLabs Blog, 20 Aug. 2025. [Online]. Available: https://elevenlabs.io/blog/eleven-v3-alpha-now-available-in-the-api

[30] S. Estrada, "MIT report: 95% of generative AI pilots at companies are failing," Fortune, 18 Aug. 2025. [Online]. Available: https://fortune.com/2025/08/18/mit-report-95-percent-generative-ai-pilots-at-companies-failing-cfo/

[31] Fortune Editorial Team, "An MIT report that 95% of AI pilots fail spooked investors, but the reason why those pilots failed is what should make the C-suite anxious," Fortune, 21 Aug. 2025. [Online]. Available: https://fortune.com/2025/08/21/an-mit-report-that-95-of-ai-pilots-fail-spooked-investors-but-the-reason-why-those-pilots-failed-is-what-should-make-the-c-suite-anxious/

[32] Loris AI Team, "MIT study: 95% of AI projects fail in enterprise deployment," Loris AI Blog, Aug. 2025. [Online]. Available: https://loris.ai/blog/mit-study-95-of-ai-projects-fail/

[33] Tech.co Editorial, "MIT: Enterprise AI pilots fail to deliver revenues," Tech.co, Aug. 2025. [Online]. Available: https://tech.co/news/mit-enterprise-ai-pilots-fail-revenues

[34] Jordan News Team, "Study: Nearly 90% of video game developers use artificial intelligence," Jordan News, 18 Aug. 2025. [Online]. Available: https://www.jordannews.jo/Section-129/Technology/Study-Nearly-90-of-Video-Game-Developers-Use-Artificial-Intelligence-44221

[35] Times of Games Team, "90% gaming developers use AI: Google study reveals industry transformation," Times of Games, Aug. 2025. [Online]. Available: https://www.timesofgames.com/news/90-gaming-developers-use-ai-google-study/

[36] Silicon UK Team, "Game developers embrace AI agents for workflow automation," Silicon UK, Aug. 2025. [Online]. Available: https://www.silicon.co.uk/e-innovation/artificial-intelligence/game-developer-ai-626380

[37] Cybernews Team, "Game developers turn to AI as industry transformation accelerates," Cybernews, 18 Aug. 2025. [Online]. Available: https://cybernews.com/ai-news/game-developers-ai-google-cloud/

[38] Reuters Team, "Nearly 90% of videogame developers use AI agents, Google study shows," Reuters, 18 Aug. 2025. [Online]. Available: https://www.reuters.com/business/nearly-90-videogame-developers-use-ai-agents-google-study-shows-2025-08-18/

[39] Bloomberg News, "Anthropic in talks to raise up to $10 billion in new funding," Bloomberg, 21 Aug. 2025. [Online]. Available: https://www.bloomberg.com/news/articles/2025-08-21/anthropic-in-talks-to-raise-up-to-10-billion-in-new-funding

[40] Techzine EU Team, "Anthropic aims to raise $10B for AI battle with OpenAI and Google," Techzine EU, 21 Aug. 2025. [Online]. Available: https://www.techzine.eu/news/applications/134003/anthropic-aims-to-raise-10b-for-ai-battle-with-openai-and-google/

[41] LinkedIn News, "Anthropic now eyeing $10B fundraise amid AI competition intensification," LinkedIn News, 21 Aug. 2025. [Online]. Available: https://www.linkedin.com/news/story/anthropic-now-eyeing-10b-fundraise-7035161/

[42] CoinCentral Team, "Investor demand doubles Anthropic's raise to $10B amid AI boom," CoinCentral, 22 Aug. 2025. [Online]. Available: https://coincentral.com/investor-demand-doubles-anthropics-raise-to-10b-amid-ai-boom/

[43] Deutsche Welle Team, "World's first humanoid robot games begin in China," DW News, 16 Aug. 2025. [Online]. Available: https://www.dw.com/en/worlds-first-humanoid-robot-games-begin-in-china/a-73652714

[44] Wikipedia Contributors, "World Humanoid Robot Games," Wikipedia, accessed Aug. 2025. [Online]. Available: https://en.wikipedia.org/wiki/World_Humanoid_Robot_Games

[45] CNBC Team, "World Humanoid Robot Games: China showcases Tesla Unitree competitors," CNBC, 18 Aug. 2025. [Online]. Available: https://www.cnbc.com/2025/08/18/world-humanoid-robot-games-china-tesla-unitree.html

[46] Smithsonian Magazine Team, "World's first robot Olympics features soccer, kickboxing and lots of falling down," Smithsonian Magazine, 19 Aug. 2025. [Online]. Available: https://www.smithsonianmag.com/smart-news/worlds-first-robot-olympics-features-soccer-kickboxing-and-lots-of-falling-down-180987199/

[47] Google Team, "Google Pixel 10: AI features and updates powered by Tensor G5," Google Blog, 20 Aug. 2025. [Online]. Available: https://blog.google/products/pixel/google-pixel-10-ai-features-updates/

[48] CNBC Team, "Google Pixel 10 series debuts with advanced Gemini AI integration," CNBC, 20 Aug. 2025. [Online]. Available: https://www.cnbc.com/2025/08/20/google-pixel-10-gemini-ai.html

[49] Wired Team, "All the new AI features in Google Pixel 10 phones," Wired, 20 Aug. 2025. [Online]. Available: https://www.wired.com/story/all-the-new-ai-features-in-google-pixel-10-phones/

[50] TechCrunch Team, "Google doubles down on AI phones with its Pixel 10 series," TechCrunch, 20 Aug. 2025. [Online]. Available: https://techcrunch.com/2025/08/20/google-doubles-down-on-ai-phones-with-its-pixel-10-series/

[51] N. K. Shankaran, "Game changer here: ByteDance's Seed-OSS 36B Instruct pushes AI boundaries," LinkedIn, Aug. 2025. [Online]. Available: https://www.linkedin.com/pulse/game-changer-here-bytedances-seed-oss-36b-instruct-pushes-shankaran-hcjbc

[52] N. Kumar L., "China's AI checkmate: ByteDance drops silicon bomb with revolutionary model," LinkedIn, Aug. 2025. [Online]. Available: https://www.linkedin.com/pulse/chinas-ai-checkmate-bytedance-drops-silicon-bomb-nantha-kumar-l-xpbnc

[53] 36Kr Editorial Team, "ByteDance releases Seed-OSS: China's latest AI breakthrough model," 36Kr, Aug. 2025. [Online]. Available: https://eu.36kr.com/en/p/3431996374142339

Discussion about this post

Ready for more?