LLM News and Articles
| Wednesday, 2025-12-24 | ||||
| 17:33 | Why LLMs Are Not Planning Machines (and why getting “good plans” is not the same as making good… https://medium.com/@arnab.c/why-llms-are-not-planning-machines-and-why-getting-good-plans-is-not-the-same-as-making-good-99b91f805560 | |||
| 17:30 | Converting Fine-Tuned LoRA LLMs for iOS & Android Inference Using MediaPipe https://medium.com/@unique.vageesh/converting-fine-tuned-lora-llms-for-ios-android-inference-using-mediapipe-857e7d48fc44 | |||
| 17:21 | Building a GraphRAG System for Civic Information Retrieval https://medium.com/@vinays.6360/building-a-graphrag-system-for-civic-information-retrieval-be9d5e6e54c0 | |||
| 16:42 | Beyond Bigger Context: Apple’s CLaRa Proposes a New Path for RAG https://medium.com/write-a-catalyst/beyond-bigger-context-apples-clara-proposes-a-new-path-for-rag-5e917bac7923 | |||
| 16:31 | OCR Isn’t Just About Reading Text — Insights from the DeepSeek OCR Research Paper https://medium.com/@visnus12a22223/ocr-isnt-just-about-reading-text-insights-from-the-deepseek-ocr-research-paper-dc63ae2861a4 | |||
| 16:23 | The Shift From AI Services to AI Infrastructure: How Companies Are Becoming Model Hosts https://ai.plainenglish.io/the-shift-from-ai-services-to-ai-infrastructure-how-companies-are-becoming-model-hosts-9ff85dcea24e | |||
| 16:06 | 2025 Retrospective: How AI Changed the Way I Engineer https://medium.com/data-engineering-space/2025-retrospective-how-ai-changed-the-way-i-engineer-c1deb44b2101 | |||
| 16:03 | Why LLMs Aren’t Enough: The Need for Orchestration with LangChain https://medium.com/@nagireddybv5/why-llms-arent-enough-the-need-for-orchestration-with-langchain-ba4b91058645 | |||
| 16:02 | 10 Cache Layers That Make RAG Feel Instant https://medium.com/@Praxen/10-cache-layers-that-make-rag-feel-instant-42a9c360a1dc | |||
| 15:58 | Building a Human-in-the-Loop Security Automation with Jira, LLMs, and AWS WAF https://medium.com/@kvanurag39/building-a-human-in-the-loop-security-automation-with-jira-llms-and-aws-waf-525647ea0cb1 | |||
| 15:44 | AWS re:Invent 2025 Recap and Why Key Announcements Matter https://sanjmo.medium.com/amazon-re-invent-2025-recap-and-why-key-announcements-matter-8ae31b734eb9 | |||
| 15:12 | Small Language Models: The Efficient Revolution in AI https://medium.com/@himansusaha/small-language-models-the-efficient-revolution-in-ai-66f04c33cc61 | |||
| 15:12 | How I Built a Postman Bot That Detects Breaking API Changes Before Deploy using LLM https://skakarh.medium.com/how-i-built-a-postman-bot-that-detects-breaking-api-changes-before-deploy-using-llm-e3d66f9f20d4 | |||
| 15:06 | The Ghost in the Matrix: Exploring the Quantum State of Large Language Models https://medium.com/@lilaroka.1199/the-ghost-in-the-matrix-exploring-the-quantum-state-of-large-language-models-93213478a4af | |||
| 15:02 | Top 7 Budget-Capped Orchestration Playbooks for Agents https://medium.com/@bhagyarana80/top-7-budget-capped-orchestration-playbooks-for-agents-18950d873aba | |||
| 15:02 | LAI #107: How AI Learns, Why It Feels Intelligent, and Where the Illusion Breaks https://pub.towardsai.net/lai-107-how-ai-learns-why-it-feels-intelligent-and-where-the-illusion-breaks-0fbf248c8362 | |||
| 14:55 | The Evolution of Reasoning in Language Models https://krayush.medium.com/the-evolution-of-reasoning-in-language-models-d30a3dedb59e | |||
| 14:52 | LLM Training: Chaos Behind Polished Output https://medium.com/@natekali/llm-training-chaos-behind-polished-output-2e8db29711af | |||
| 14:46 | Why Special Tokens Matter in LLMs: From Chat Formatting to Cutting-Edge Control https://medium.com/@nisarg.nargund/why-special-tokens-matter-in-llms-from-chat-formatting-to-cutting-edge-control-177273632055 | |||
| 14:39 | Making LLM Benchmarking Boring (In the Best Way) https://blog.gopenai.com/making-llm-benchmarking-boring-in-the-best-way-9dff06e31ac1 | |||
| 13:17 | Animated LLM – Understand the Mechanics of LLMs https://animatedllm.github.io/ | |||
| 13:13 | How do Large Language Models Work https://medium.com/@arnavk1802/how-do-large-language-models-work-cb94b5c173f2 | |||
| 13:11 | The Architecture Behind Modern LLMs: Transformers, Attention, KV Cache & Scaling https://noncodersuccess.medium.com/the-architecture-behind-modern-llms-transformers-attention-kv-cache-scaling-20e4ba91779a | |||
| 12:46 | Building a Character-Level Language Model for Indian Names: Lessons in Smoothing https://medium.com/mathematics-and-machine-learning/building-a-character-level-language-model-for-indian-names-lessons-in-smoothing-3753e67b1781 | |||
| 12:40 | How Can AI Eliminate Response Bias in Customer Satisfaction Scores? https://medium.com/@max.s_33396/how-can-ai-eliminate-response-bias-in-customer-satisfaction-scores-bc742ebc2db3 | |||
| 12:38 | Engineering Memory for AI Agents: A Practical Guide https://medium.com/@sahin.samia/engineering-memory-for-ai-agents-a-practical-guide-115a8966e673 | |||
| 12:34 | The Hidden Hero: How Tokenization Shapes AI Language Models https://medium.com/@sprinklr.ai/the-hidden-hero-how-tokenization-shapes-ai-language-models-908cd18f83fb | |||
| 12:34 | Why LLMs Train on 45TB Data: Shocking NLP Stats https://medium.com/@vikramlingam/why-llms-train-on-45tb-data-shocking-nlp-stats-b25338edbb74 | |||
| 12:12 | I Thought My NLP Training Was Obsolete in the LLM era. I Was Wrong. https://medium.com/@tahaymerghani/i-thought-my-nlp-training-was-obsolete-in-the-llm-era-i-was-wrong-c4be804d9f69 | |||
| 12:02 | Model Context Protocol (MCP) Explained: Definition, Architecture, and How it Actually Works? https://pub.towardsai.net/model-context-protocol-mcp-explained-definition-architecture-and-how-it-actually-works-58b9d08c98b5 | |||
| 11:06 | Transformers & LLMs — Part 7: Pre-training at Scale and Training Optimizations https://medium.com/@ashishbodla/transformers-llms-part-7-pre-training-at-scale-and-training-optimizations-6bbcbb5b9c31 | |||
| 10:31 | Microsoft’s Wild Bet: Ditch All C++ for Rust by 2030? https://medium.com/coding-nexus/microsofts-wild-bet-ditch-all-c-for-rust-by-2030-51c93dc25957 | |||
| 10:11 | Meeting “Peachy”: Giving Google Gemini a Body with Hugging Face’s Reachy Mini https://medium.com/google-cloud/meeting-peachy-giving-google-gemini-a-body-with-hugging-faces-reachy-mini-24602e1ff78b | |||
| 10:07 | AI in 2025: A builder’s retrospective https://medium.com/@pranavsinghania08/ai-in-2025-a-builders-retrospective-1bb2292214bc | |||
| 10:04 | Upgrading My Local ChatGPT App: Embedding Explorer, Document Intelligence & Research Tools (100%… https://medium.com/@manojramoorthy/upgrading-my-local-chatgpt-app-embedding-explorer-document-intelligence-research-tools-100-ad7653e1a689 | |||
| 09:52 | AI and LLMs: The New Era of SEO https://medium.com/@zeeshanhaiderjhang01/ai-and-llms-the-new-era-of-seo-775871b9f144 | |||
| 09:33 | Everything I learned while building a Retrieval-Augmented Generation (RAG) system. https://medium.com/@yashrajojha/everything-i-learned-while-building-a-rag-system-b3d49da0f95d | |||
| 09:25 | Best Large Language Model (LLM) Courses | at Visualpath https://medium.com/@kalyanvisualpath/best-large-language-model-llm-courses-at-visualpath-1d3aa2fae852 | |||
| 09:01 | Why Multiple AI Perspectives Beat a Single “Good” Answer https://medium.com/@nianjiniuniu/why-multiple-ai-perspectives-beat-a-single-good-answer-5d8c287b318d | |||
| 08:50 | RAG — PART 1 Introduction https://medium.com/@harsh_77214/rag-part-1-introduction-54eca5f98191 | |||
| 08:43 | Fine-Tuning Strategy for Speaker Recognition https://medium.com/@rahuldudi1349/fine-tuning-strategy-for-speaker-recognition-da65b6294505 | |||
| 08:37 | What are AI Agents https://medium.com/@nilekakavisandi/what-are-ai-agents-ebc9720a7399 | |||
| 08:31 | Future of AI Agents: 5 Foundational Features in WyseOS That Point to the https://medium.com/@EricZhang2015/future-of-ai-agents-5-foundational-features-in-wyseos-that-point-to-the-2b491b531408 | |||
| 08:21 | The new computing paradigm https://medium.com/@darmousseh/the-new-computing-paradigm-60f21ee12d54 | |||
| 08:03 | OCR et extraction automatique d’information : des progrès spectaculaires… jusqu’à un plafond… https://wolffmarc.medium.com/ocr-et-extraction-automatique-dinformation-des-progr%C3%A8s-spectaculaires-jusqu-%C3%A0-un-plafond-9efdd7c14853 | |||
| 08:01 | Stop Spamming Cloud LLMs for Simple Tasks: Leveraging Apple’s On-Device AI for iOS https://blog.simprasuite.com/stop-spamming-cloud-llms-for-ced64086ab54 | |||
| 07:47 | DeepSeek V3.2’nin teknik olarak farklı yaptığı şey ve neden önemli olduğu https://medium.com/@cenghanbayram35/deepseek-v3-2nin-teknik-olarak-farkl%C4%B1-yapt%C4%B1%C4%9F%C4%B1-%C5%9Fey-ve-neden-%C3%B6nemli-oldu%C4%9Fu-04098fc419a5 | |||
| 07:22 | Thinking of Agent Context https://medium.com/@dataplusai/thinking-of-agent-context-1ac4335dc5eb | |||
| 07:17 | When Models Pick Sides: How AI Learns to Discriminate https://medium.com/latent-pulse/when-models-pick-sides-how-ai-learns-to-discriminate-73eb775271d5 | |||
| 07:16 | Inside NVIDIA Nemotron 3: Hybrid MoE Models Built for Multi-Agent AI https://arunaddagatla.medium.com/inside-nvidia-nemotron-3-hybrid-moe-models-built-for-multi-agent-ai-2aabcc056e93 | |||
| 06:58 | Notes on GenAI/LLM https://medium.com/@ianchenmu/notes-on-genai-llm-e49a288ed31c | |||
| 06:32 | The Stateless Reality: Context in a Single Shot https://medium.com/@dan_collins/the-stateless-reality-context-in-a-single-shot-4bfee7dd633b | |||
| 06:20 | Building Production-Ready LLM Systems in 2025: The Strategic Tech Stack https://medium.com/@eng.fadishaar/building-production-ready-llm-systems-in-2025-the-strategic-tech-stack-00208a89206e | |||
| 06:06 | Yann LeCun’s Advice To AI Students And The Growing Divide On AGI. https://medium.com/@tech__manas/yann-lecuns-advice-to-ai-students-and-the-growing-divide-on-agi-034212f7d168 | |||
| 05:48 | TRANSFORMER ATTENTION PERCEPTION https://medium.com/@krishnaprasath42213/transformer-attention-perception-aa249f4a6bb5 | |||
| 05:36 | From Prompt Engineering to Context Engineering https://medium.com/@rahulbhalley/from-prompt-engineering-to-context-engineering-53e216557da4 | |||
| 05:32 | LLM Red-Team-in-a-Box: Prompt Injection, Data Exfil, and Safe-by-Default Middleware https://medium.com/@2nick2patel2/llm-red-team-in-a-box-prompt-injection-data-exfil-and-safe-by-default-middleware-e00cee361446 | |||
| 05:15 | Maincoder-1B – an open 1B-parameter coding model with 76% HumanEval https://huggingface.co/Maincode/Maincoder-1B | |||
| 04:29 | Stop Chasing JSON: Making LLM Outputs Type-Safe in TypeScript https://levelup.gitconnected.com/stop-chasing-json-making-llm-outputs-type-safe-in-typescript-7e121f427bf3 | |||
| 04:23 | Private LLMs vs Open-Source Models: How to Choose the Right One? https://marutitech.medium.com/private-llms-vs-open-source-models-5851a973137e | |||
| 04:10 | Google Health AI Releases MedASR: a Conformer Based Medical Speech to Text Model for Clinical Dictation https://www.marktechpost.com/2025/12/23/google-health-ai-releases-medasr-a-conformer-based-medical-speech-to-text-model-for-clinical-dictation/ | |||
| 04:02 | Your RAG System Is Making Up Facts Right Now https://medium.com/@mdfadil/your-rag-system-is-making-up-facts-right-now-c73e6bc44cdb | |||
| 04:00 | Decoding Memorization in Diffusion Models: Breaking down the Best NeurIPS’25 paper https://medium.com/@kakadechaitanya77/decoding-memorization-in-diffusion-models-breaking-down-the-best-neurips25-paper-52544dc28ac6 | |||
| 03:42 | This 20-Minute n8n Workflow Runs My Entire Side Hustle While I Sleep https://medium.com/@AThoughtbySnehal/this-20-minute-n8n-workflow-runs-my-entire-side-hustle-while-i-sleep-b44664f318d0 | |||
| 03:41 | SEO, AEO, GEO, and LLMO Explained: The Complete Guide to Modern Search Optimization https://neel1701.medium.com/seo-aeo-geo-and-llmo-explained-the-complete-guide-to-modern-search-optimization-d4b26f706af8 | |||
| 03:12 | Poetiq achieves 75% at under / problem using GPT-5.2 X-High on ARC-AGI-2 https://poetiq.ai/posts/arcagi_announcement/ | |||
| 03:07 | How to Become AGI https://medium.com/@zichengxu/how-to-become-agi-a5b2d7d74bda | |||
| 02:52 | How to Build a Scalable Information Extraction System (Without Losing Your Mind) https://medium.com/top-python-libraries/how-to-build-a-scalable-information-extraction-system-without-losing-your-mind-5dd76cfe8f33 | |||
| 02:46 | I asked LLMs to analyze some of our favorite companies pitchdecks https://medium.com/@samgivian2015/i-asked-llms-to-analyze-some-of-our-favorite-companies-pitchdecks-cf88b42c0e89 | |||
| 02:38 | Gemma Scope 2: A Microscope for Understanding Large Language Models https://medium.com/coding-nexus/gemma-scope-2-a-microscope-for-understanding-large-language-models-09c7b14e7877 | |||
| 02:25 | I Will (Most-Frequently) Come Back to Medium Within 2026. https://medium.com/@boatchrnthn/i-will-most-frequently-come-back-to-medium-within-2026-b924b689e250 | |||
| 02:03 | The Missed Call from the Future https://medium.com/coding-nexus/the-missed-call-from-the-future-7e95b008dab6 | |||
| 02:03 | Linear Regression: @ https://medium.com/@yashkamde19/linear-regression-bff95588d8f8 | |||
| 01:56 | What the hell does an LLM actually do? https://medium.com/@adhirajtiwari0307/what-the-hell-does-an-llm-actually-do-83e84e4c6896 | |||
| 00:31 | Choosing the Right LLM for Cognee: Local Ollama Setup https://medium.com/@rosgluk/choosing-the-right-llm-for-cognee-local-ollama-setup-bc257fc4fa58 | |||
| 00:20 | Learning JAX by Building Flexible Transformer Attention Masks: From Causal to Prefix-LM https://medium.com/@zdj0712/learning-jax-by-building-flexible-transformer-attention-masks-from-causal-to-prefix-lm-de1edafe2868 | |||
| 00:10 | Gemini Has “Severe Anxiety”? Even AI Can’t Handle Corporate Vibes Anymore https://medium.com/@jamesmiller22871/gemini-has-severe-anxiety-even-ai-cant-handle-corporate-vibes-anymore-56c0153d57eb | |||
| 00:00 | Open ended, continual learning are well on their way to being solved: Reflections from NeurIPS 2025 https://medium.com/@sunchipsster/open-ended-continual-learning-are-well-on-their-way-to-being-solved-reflections-from-neurips-2025-ad618fe39e7f | |||
| Tuesday, 2025-12-23 | ||||
| 23:58 | Your AI Is Snitching on You (And You’re Helping It) https://medium.com/@aiservices.review/your-ai-is-snitching-on-you-and-youre-helping-it-63f8dad9b81f | |||
| 23:54 | What Claude Does When the Conversation Never Ends: Emergent Behavior When an AI Is Given Freedom… https://georgesalapa.medium.com/what-claude-does-when-the-conversation-never-ends-emergent-behavior-when-an-ai-is-given-freedom-36e5f45aa86d | |||
| 23:25 | Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care… https://medium.com/@bader.tony/applications-and-concerns-of-chatgpt-and-other-conversational-large-language-models-in-health-care-3d6fa7cb1a99 | |||
| 22:53 | Top 10 AI Testing Tools You Need to Know in 2026 https://medium.com/@techlatest.net/top-10-ai-testing-tools-you-need-to-know-in-2026-9cfc6e940edf | |||
| 22:35 | The breakthrough of Large Language Models: How transformers have revolutionized AI https://medium.com/@isangmin0503/the-breakthrough-of-large-language-models-how-transformers-have-revolutionized-ai-d93de955742e | |||
| 22:12 | AI Tools Shaping Scientific Research in 2026 https://medium.com/@bishakhghosh0/ai-tools-shaping-scientific-research-in-2026-1a98229fd50b | |||
| 22:05 | The Hidden Event Problem: Why My AI Agent Kept Losing Its Memory (And How I Fixed It) https://medium.com/@vigneshvar.a.s/the-hidden-event-problem-why-my-ai-agent-kept-losing-its-memory-and-how-i-fixed-it-b9ddb0452f26 | |||
| 21:41 | Stop using temperature 1.0 https://medium.com/@glanzz/stop-using-temperature-1-0-385cb51ac863 | |||
| 21:30 | The Evolution of Context Engineering: From Prompt Hacking to Cognitive Architectures https://medium.com/data-science-collective/the-evolution-of-context-engineering-from-prompt-hacking-to-cognitive-architectures-14eb17243ef5 | |||
| 21:14 | Local AI is a pipe dream https://suryakasturi.medium.com/local-ai-is-a-pipe-dream-de836fff42bf | |||
| 21:07 | AI Quality Engineer — Newsletter https://medium.com/ai-in-quality-assurance/ai-quality-engineer-newsletter-f9cd9fe0b389 | |||
| 20:54 | DeepSeek V3.2: How an Open-Source Model Is Quietly Catching Up to GPT-5 https://medium.com/@sohilkhan.de1206/deepseek-v3-2-how-an-open-source-model-is-quietly-catching-up-to-gpt-5-9ac7e80446c6 | |||
| 20:46 | Engenharia de Sistemas de IA: Construindo Aplicações Determinísticas sobre o Caos Probabilístico https://medium.com/@emanuel.junior.dev/engenharia-de-sistemas-de-ia-construindo-aplica%C3%A7%C3%B5es-determin%C3%ADsticas-sobre-o-caos-probabil%C3%ADstico-b473cd32d71c | |||
| 20:43 | Show HN: TypeScript template for building ChatGPT Apps https://github.com/pomerium/chatgpt-app-typescript-template | |||
| 20:38 | GEO is Not Geography: The Specificity Tax, The Ghost Bakery, and the New Rules of AI Search https://torikrzyy.medium.com/geo-is-not-geography-the-specificity-tax-the-ghost-bakery-and-the-new-rules-of-ai-search-1912bd567389 | |||
| 20:02 | Your AI Shouldn’t Do Math: 5 Lessons From Building a Financial Analyst locally on my Laptop https://pub.towardsai.net/your-ai-shouldnt-do-math-5-lessons-from-building-a-financial-analyst-locally-on-my-laptop-a90f5b43b7d7 | |||
| 19:57 | Improving Query Understanding and Document Retrieval in Search Engines Using BERT and Large… https://medium.com/@bangyi.yang.dev/improving-query-understanding-and-document-retrieval-in-search-engines-using-bert-and-large-55c866b7a148 | |||
| 19:31 | LLM Fine-Tuning Showdown: Full Fine-Tuning vs LoRA vs QLoRA — Which Method Should You Choose? https://medium.com/@birla2006/llm-fine-tuning-showdown-full-fine-tuning-vs-lora-vs-qlora-which-method-should-you-choose-b876c76ab86e | |||
| 19:10 | The Art of Agentic Rules: How to Architect a Project-Aware AI https://medium.com/@ayomideonibokun/the-art-of-agentic-rules-how-to-architect-a-project-aware-ai-da2863844219 | |||
| 19:09 | The Hidden Math Behind LLM Quantization: Why Float16 ≠ Float32 ≠ Int8 https://medium.com/@karthiklogan8/the-hidden-math-behind-llm-quantization-why-float16-float32-int8-b483271c7348 | |||
| 18:46 | An Analytical Review of MiniMax M2.1 https://medium.com/@leucopsis/an-analytical-review-of-minimax-m2-1-30eb5754b2d0 | |||
| 18:42 | Need K? Copy The 5 “Boring” AI Workers I Built That Make Money While You Sleep https://medium.com/@AThoughtbySnehal/need-50k-copy-the-5-boring-ai-workers-i-built-that-make-money-while-you-sleep-d38672e2fce1 | |||
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124