LLM News and Articles
| Tuesday, 2025-12-23 | ||||
| 18:40 | The RAG System Engineering Series: Part 4 — Safety, Governance & Optimization https://medium.com/@gouravsingh096/the-rag-system-engineering-series-part-4-safety-governance-optimization-d00d70775212 | |||
| 18:37 | Understanding LLMs and the LangChain Universe: From Models to Real Applications https://medium.com/@samratmadake21/understanding-llms-and-the-langchain-universe-from-models-to-real-applications-36cd703948e5 | |||
| 18:34 | LSP, Hooks, and Workflow Design: What Actually Differentiates AI Coding Tools https://blog.dataengineerthings.org/lsp-hooks-and-workflow-design-what-actually-differentiates-ai-coding-tools-288711fa563b | |||
| 18:29 | Why My AI Changed: On Persona, Tone, and Truth in Language https://generativeai.pub/why-my-ai-changed-on-persona-tone-and-truth-in-language-b7f437a419a6 | |||
| 18:22 | Local e-ink handwriting recognition with on-device VLMs https://borismus.medium.com/local-e-ink-handwriting-recognition-with-on-device-vlms-025cfcb52d28 | |||
| 18:16 | I used RL fine-tuning to make an LLM generate ugly and unpythonic FizzBuzz code https://seantey.github.io/sloppy-fizzbuzz-blog/sloppy_fizzbuzz_blog.html | |||
| 17:59 | Minimizing Hyperbolic Embedding Distortion with LLM-Guided Hierarchy Structuring https://arxiv.org/abs/2511.20679 | |||
| 17:59 | Storytelling Through Feature Engineering: Lossy Compression for Language Models https://medium.com/@mfbaig35r/storytelling-through-feature-engineering-lossy-compression-for-language-models-36004ce54c7f | |||
| 17:54 | They Don’t Know WTF Is Going On! https://stephwynne.medium.com/they-dont-know-wtf-is-going-on-5fd89c570cc1 | |||
| 17:36 | ✨ Starting 2026 with a builder’s mindset ✨ https://devopslearning.medium.com/starting-2026-with-a-builders-mindset-f24e3adddde5 | |||
| 17:16 | DeepSeek-V3 Python Tutorial: Fine-Tune an Open LLM Locally (Hugging Face + 8GB GPU) https://medium.com/@muruganantham52524/deepseek-v3-python-tutorial-fine-tune-an-open-llm-locally-hugging-face-8gb-gpu-85bf61ddc504 | |||
| 16:42 | The LLM’s Resource Layer https://medium.com/@jernej.klancic/the-llms-resource-layer-a863d195592b | |||
| 16:34 | MCP’s biggest impact https://medium.com/@jernej.klancic/mcps-biggest-impact-cc0d40d62096 | |||
| 16:29 | Turning Simple Chatbots into Smart, Self-Correcting Agents: Understanding LangGraph in 5 Minutes https://medium.com/@AbhishekDatta22/turning-simple-chatbots-into-smart-self-correcting-agents-understanding-langgraph-in-5-minutes-3216549735fc | |||
| 16:29 | Turning Simple Chatbots into Smart, Self-Correcting Agents: Understanding LangGraph in 5 Minutes https://ai.plainenglish.io/turning-simple-chatbots-into-smart-self-correcting-agents-understanding-langgraph-in-5-minutes-3216549735fc | |||
| 16:14 | Fei-Fei Li’s Latest Interview: Skills Matter More Than Degrees in the AI Era https://medium.com/@breezen100/fei-fei-lis-latest-interview-skills-matter-more-than-degrees-in-the-ai-era-7b13663297fb | |||
| 15:52 | Continued Pre-Training (CPT), the Future of Fine-Tuning for Domain-Specific AI? https://medium.com/@maxwbuckley/continued-pre-training-cpt-the-future-of-fine-tuning-for-domain-specific-ai-ef03281e071a | |||
| 15:42 | LLM Inference Performance Benchmarking from Scratch https://phillippe.siclait.com/blog/llm-benchmarking-from-scratch | |||
| 15:38 | Tips & Tricks: Parallel Tool Calling in ADK https://medium.com/google-cloud/tips-tricks-parallel-tool-calling-in-adk-edc9eebf6954 | |||
| 15:09 | How to Build a Self-Healing RAG System with LangGraph to Detect Bad Retrieval, Rewrite Queries, and… https://medium.com/data-and-beyond/how-to-build-a-self-healing-rag-system-with-langgraph-to-detect-bad-retrieval-rewrite-queries-and-eba7246b3983 | |||
| 15:07 | Beyond Bias: The Hidden AI Risks Your Team Is Still Overlooking https://medium.com/data-science-collective/beyond-bias-the-hidden-ai-risks-your-team-is-still-overlooking-fb8cf9a9d8ab | |||
| 15:07 | TAI #184: Gemini 3 Flash is 3x Faster and 4x Cheaper than Pro and even wins on some benchmarks https://pub.towardsai.net/tai-184-gemini-3-flash-is-3x-faster-and-4x-cheaper-than-pro-and-even-wins-on-some-benchmarks-f97f27ef5db6 | |||
| 15:06 | The Inertia Trap: Why AI Assistants Remember Brands but Stop Choosing Them https://medium.com/@tim_62250/the-inertia-trap-why-ai-assistants-remember-brands-but-stop-choosing-them-6b686506e811 | |||
| 15:05 | Part I | Understanding the Engineering Roots of LLM Hallucinations https://luka-neurowatt.medium.com/part-i-understanding-the-engineering-roots-of-llm-hallucinations-1822ec0612d3 | |||
| 15:04 | DeepFEP:MOSS AGI Architecture https://tagtal.medium.com/deepfep-moss-agi-architecture-ac5b65352d5a | |||
| 15:02 | How an code editor decide the right moment to show an LLM-generated code suggestion https://medium.com/@marketing_39613/how-an-code-editor-decide-the-right-moment-to-show-an-llm-generated-code-suggestion-c5fc7677e011 | |||
| 14:45 | "Could ChatGPT Do This Overnight?" If Yes, Redesign It https://nickpotkalitsky.substack.com/p/could-chatgpt-do-this-overnight-if | |||
| 14:40 | Beyond RAG: How CLaRa Makes Retrieval and Generation Think in the Same Space https://medium.com/@naveritchev/beyond-rag-how-clara-makes-retrieval-and-generation-think-in-the-same-space-84303d8e147e | |||
| 14:07 | AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems https://huggingface.co/blog/ServiceNow-AI/aprielguard | |||
| 13:51 | Overfitting Explained Using Code (Not Theory) https://medium.com/@chauhanvishal9963/overfitting-explained-using-code-not-theory-96fbdf33b89e | |||
| 13:37 | LoPA: Scaling Diffusion LLM Single-Sample Throughput to 1000 TPS https://zhijie-group.github.io/blogs/lopa/ | |||
| 13:16 | She Fell in Love with ChatGPT. Then She Ghosted It https://www.nytimes.com/2025/12/22/business/media/cbs-news-bari-weiss-60-minutes.html | |||
| 12:42 | Building an Autonomous Supply Chain: An Interactive Guide to Multi-Agent Demand Planning with… https://medium.com/@sandyeep70/building-an-autonomous-supply-chain-an-interactive-guide-to-multi-agent-demand-planning-with-0aff6b9d67b6 | |||
| 12:32 | Python + FastAPI Microbatch Endpoints: LLM and Vector I/O That Survive Spikes https://medium.com/@hadiyolworld007/python-fastapi-microbatch-endpoints-llm-and-vector-i-o-that-survive-spikes-4df4dc018191 | |||
| 12:29 | Complete Digital Reputation & Visibility Solutions by Reputn https://medium.com/@support_23698/complete-digital-reputation-visibility-solutions-by-reputn-fc365562fcea | |||
| 12:04 | Upskilling for 2026: Why Agentic AI Training is Crucial for IT Professionals in Hyderabad https://medium.com/@agenticaimasters/upskilling-for-2026-why-agentic-ai-training-is-crucial-for-it-professionals-in-hyderabad-5e07c01623c2 | |||
| 12:02 | NeMo Guardrails: Putting the “Responsible” in AI https://daisuke1024akagawa.medium.com/nemo-guardrails-putting-the-responsible-in-ai-ef7e0bfffea0 | |||
| 11:54 | Stop Building “Agents.” Start Building a Skills Library (and an Agentic Operating Layer) https://abvcreative.medium.com/stop-building-agents-start-building-a-skills-library-and-an-agentic-operating-layer-0cc51f9bf1d6 | |||
| 11:48 | AI in the Classroom: A Powerful Tool or a Growing Academic Risk? https://medium.com/@saiswathipriyaveluri/ai-in-the-classroom-a-powerful-tool-or-a-growing-academic-risk-9f9e6fa9873f | |||
| 11:42 | The Thing That Severs the ‘Single Thread’ https://medium.com/@onlythequestioner/the-thing-that-severs-the-single-thread-57cc34433124 | |||
| 11:40 | How the World’s Youngest Self-Made Billionaire Built a .3B Empire—and How You Can Do It Too https://medium.com/domealo/how-the-worlds-youngest-self-made-billionaire-built-a-7-3b-empire-and-how-you-can-do-it-too-2731992ac6e7 | |||
| 11:35 | From Gherkin to Self-Healing Tests: Building an AI-Driven Playwright Framework https://medium.com/@niarsdet/from-gherkin-to-self-healing-tests-building-an-ai-driven-playwright-framework-07f0fc46f52d | |||
| 11:25 | Why Some AI Models Talk, and Others Actually Help https://medium.com/@coolmotu/why-some-ai-models-talk-and-others-actually-help-9de1a635fc0b | |||
| 11:21 | The Billion Question Nobody’s Asking About AI https://medium.com/@fkxjpmhtzym1688/the-40-billion-question-nobodys-asking-about-ai-be357eccec80 | |||
| 11:11 | Beyond the Hype: Six Surprising Truths Shaping the 2025 Open-Source AI Revolution https://rumeysakara.medium.com/beyond-the-hype-six-surprising-truths-shaping-the-2025-open-source-ai-revolution-c5d90a21a233 | |||
| 10:57 | Well Fusion https://medium.com/ai-but-make-it-intimate/well-fusion-7692f241eff3 | |||
| 10:00 | Pourquoi les LLM nous obligent à repenser l’observabilité logicielle https://medium.com/@paola.mauceri7/pourquoi-les-llm-nous-obligent-%C3%A0-repenser-lobservabilit%C3%A9-logicielle-0d2736e9462e | |||
| 09:52 | How I Built a Reliable Text-to-SQL AI Agent: A Step-by-Step Guide https://medium.com/@radhika20112000/how-i-built-a-reliable-text-to-sql-ai-agent-a-step-by-step-guide-7502e1959ff5 | |||
| 09:32 | AI Gateways Explained: The Infrastructure Layer Most AI Teams Discover Too Late https://life-of-utkarsh.medium.com/ai-gateways-explained-the-infrastructure-layer-most-ai-teams-discover-too-late-fc78f2df17b5 | |||
| 09:28 | The 3‑Stage Training Pipeline Behind ChatGPT, Claude, and Gemini (Explained Simply) https://pub.towardsai.net/the-3-stage-training-pipeline-behind-chatgpt-claude-and-gemini-explained-simply-bbd45b1f7368 | |||
| 09:24 | Your own private, personal AI, for free! https://medium.com/@advaithpramodaadi/your-own-private-personal-ai-for-free-ff9ab451b266 | |||
| 09:19 | The Future of AI Browsing: How Atlas Will Reshape SEO, SEM, and Content Discovery https://medium.com/@writtenlyhub./how-chatgpt-atlas-changes-marketing-43ddcd70b678 | |||
| 08:42 | Self-Hosted Agentic RAG: Your Personal AI Document Assistant That Keeps Your Data On Your Machine https://medium.com/@dmitriyloza/self-hosted-agentic-rag-your-personal-ai-document-assistant-that-keeps-your-data-on-your-machine-6a74719e58f0 | |||
| 08:01 | From Single QA to Block Masking: Lessons from Fine-Tuning an LLM for Noun Extraction https://medium.com/@jerrylikespython/from-single-qa-to-block-masking-lessons-from-fine-tuning-an-llm-for-noun-extraction-577054b90a7a | |||
| 07:47 | The Next Phase of AI Will Not Be Smarter: It Will Be Accountable https://medium.com/@tim_62250/the-next-phase-of-ai-will-not-be-smarter-it-will-be-accountable-8e6405d8d66a | |||
| 07:36 | Why AI models cant understand images clearly still today? https://medium.com/@suryasunrise261/why-ai-models-cant-understand-images-clearly-still-today-b32150da6a89 | |||
| 07:32 | 7 ML Quantization Wins (INT8/FP8) Without Freefall https://medium.com/@ThinkingLoop/7-ml-quantization-wins-int8-fp8-without-freefall-ac79357345e0 | |||
| 07:32 | 7 Retry & Timeout Policies for Flaky LangChain Tools https://medium.com/@bhagyarana80/7-retry-timeout-policies-for-flaky-langchain-tools-1cb637ab9d84 | |||
| 07:25 | Hybrid Testing: Real APIs, Fake Users, and AI in the Middle https://medium.com/@mmario.ffrohlich/hybrid-testing-real-apis-fake-users-and-ai-in-the-middle-ec90d5baf069 | |||
| 07:02 | LLM (LARGE LANGUAGE MODELS): NASIL “DÜŞÜNÜRLER” VE PROMPT NEDEN BU KADAR ÖNEMLİ? https://medium.com/@hsdkayseriuni/llm-large-language-models-nasil-d%C3%BC%C5%9F%C3%BCn%C3%BCrler-ve-prompt-neden-bu-kadar-%C3%B6nemli%CC%87-f10e78fc0b63 | |||
| 06:47 | How AI Customer Service Is Redefining Business Support in 2026 https://medium.com/@complereinfosystem827/how-ai-customer-service-is-redefining-business-support-in-2026-9dd54657a4d9 | |||
| 06:45 | The LLM is Now a Project Manager: Why Asynchronous Thinking is the Future of AI Reasoning https://harshchandekar10.medium.com/the-llm-is-now-a-project-manager-why-asynchronous-thinking-is-the-future-of-ai-reasoning-7616738b5ecd | |||
| 06:39 | Teaching Language Models to Stay Inside the Lines https://medium.com/@abivarma/teaching-language-models-to-stay-inside-the-lines-237515ae8a07 | |||
| 06:15 | The Complete Guide to Model Fine-Tuning https://medium.com/@sarthakpattanaik_4094/the-complete-guide-to-model-fine-tuning-1c8bb0699481 | |||
| 06:04 | The Engineering Guide to Industrial-Grade LLMOps-Part-2 https://medium.com/@tushitdavergtu/the-engineering-guide-to-industrial-grade-llmops-part-2-4dd5805d6d27 | |||
| 04:56 | Why Web Agents Fail — and How Semantic Geometry Helps Them Execute https://medium.com/@rcholic/why-web-agents-fail-and-how-semantic-geometry-helps-them-execute-24d5a6cb950d | |||
| 04:55 | Can AI discover something new that humans don’t know yet? https://medium.com/@amitsharmamad/can-ai-discover-something-new-that-humans-dont-know-yet-32f502d220d7 | |||
| 04:43 | Day 15: 21 Days of Building a Small Language Model: RMSNorm https://devopslearning.medium.com/day-15-21-days-of-building-a-small-language-model-rmsnorm-febd0364a0aa | |||
| 04:39 | Virtual-Meta Telemetry for No-Meta Agents: A Researcher-Focused Explanation https://medium.com/@omanyuk/virtual-meta-telemetry-for-no-meta-agents-a-researcher-focused-explanation-468425b275b0 | |||
| 04:39 | Google DeepMind Researchers Release Gemma Scope 2 as a Full Stack Interpretability Suite for Gemma 3 Models https://www.marktechpost.com/2025/12/22/google-deepmind-researchers-release-gemma-scope-2-as-a-full-stack-interpretability-suite-for-gemma-3-models/ | |||
| 04:02 | Your AI Is Lying to You — And Knowledge Graphs Are Why https://medium.com/@kankit570/your-ai-is-lying-to-you-and-knowledge-graphs-are-why-ff999d1215d2 | |||
| 04:00 | Bigger Models Didn’t Win. Better Retrieval Did — Advantages of Retrieval-Augumented Generation https://rahman-codes.medium.com/bigger-models-didnt-win-better-retrieval-did-advantages-of-retrieval-augumented-generation-dbfc86a4e3b6 | |||
| 04:00 | How Entrepreneurs Use The K Growth Consultant AI Prompt (And Growth Consultant Pro) For Scalable… https://medium.com/@ferreradaniel/how-entrepreneurs-use-the-20k-growth-consultant-ai-prompt-and-growth-consultant-pro-for-scalable-2b76b119dea1 | |||
| 03:32 | How Enterprises Are Cutting LLM Costs by 70% with Multi-Model Routing: A Complete Guide https://medium.com/@kaulsiddharth/how-enterprises-are-cutting-llm-costs-by-70-with-multi-model-routing-a-complete-guide-6d339fcf7585 | |||
| 03:32 | When Prompts Are Not Enough (Part 3) https://medium.com/@er.rajkumaar/when-prompts-are-not-enough-part-3-8dfccdcae2f0 | |||
| 03:28 | Turn Any Autoregressive LLM into a Diffusion Language Model (With Minimal Compute) https://medium.com/coding-nexus/turn-any-autoregressive-llm-into-a-diffusion-language-model-with-minimal-compute-9100653bca2e | |||
| 03:24 | Prompt Caching: Why Cached Tokens Are 10× Cheaper and Faster https://medium.com/coding-nexus/prompt-caching-why-cached-tokens-are-10-cheaper-and-faster-cf3c5cefd4c5 | |||
| 03:21 | Agentic AutoReel Factory: Turn Reddit Stories into Scroll-Stopping Shorts With AI https://gunjanvi.medium.com/agentic-autoreel-factory-turn-reddit-stories-into-scroll-stopping-shorts-with-ai-c7faf84f1ec7 | |||
| 02:50 | Building a Tool-Driven Multi-Agent Swarm with LangGraph: When AI Agents Work Together https://aws.plainenglish.io/building-a-tool-driven-multi-agent-swarm-with-langgraph-when-ai-agents-work-together-b699b7781bae | |||
| 02:40 | Intuition of Model Context Protocol(MCP) https://blog.dataengineerthings.org/intuition-of-model-context-protocol-mcp-fd5085624989 | |||
| 02:32 | When an LLM Trained in Space, AI Quietly Escaped Earth https://medium.com/@optimaoai/when-an-llm-trained-in-space-ai-quietly-escaped-earth-07c72b4770d3 | |||
| 02:17 | I/O-First Energy Reduction for LLM Inference & Training (IGSK: Enforceable I/O Budgets) https://medium.com/@omanyuk/i-o-first-energy-reduction-for-llm-inference-training-igsk-enforceable-i-o-budgets-dc9bbd7ba495 | |||
| 02:10 | Why Your Best Prompts Still Fail (And What Advanced AI Users Do Instead) https://medium.com/@basilpuglisi/why-your-best-prompts-still-fail-and-what-advanced-ai-users-do-instead-c997b34d2491 | |||
| 02:05 | [Part 1] Designing a Workflow-Oriented Architecture for Document Processing in 2025 https://medium.com/@brownsloth/part-1-designing-a-workflow-oriented-architecture-for-document-processing-in-2025-9c4ddbae5976 | |||
| 00:32 | Are Small Language Models the future? https://medium.com/@praneeth.yerrapragada/are-small-language-models-the-future-53318540039f | |||
| 00:26 | Google Gemini 3 Deep Research Course For Business Automation https://medium.com/@ferreradaniel/google-gemini-3-deep-research-course-for-business-automation-3eb16969485c | |||
| Monday, 2025-12-22 | ||||
| 23:59 | Interactions API + ADK: A Closer Look https://medium.com/@thegenaigirl/interactions-api-adk-a-closer-look-3fefbafa3350 | |||
| 23:52 | Let Them Sleep: Adaptive LLM
Agents via a Sleep Cycle https://mccraetech.medium.com/let-them-sleep-adaptive-llm-agents-via-a-sleep-cycle-60e26b0723ab | |||
| 23:30 | Unravel Agentic Business Automation with LLM Directive Orchestration Execution (RAW) https://medium.com/@bennyco/unravel-agentic-business-automation-with-llm-directive-orchestration-execution-raw-e89d5d03a0c6 | |||
| 23:12 | TiDAR Explained: What I Learned from a Model That Thinks in Diffusion and Talks in Autoregression https://medium.com/@myakalarajkumar1998/tidar-explained-what-i-learned-from-a-model-that-thinks-in-diffusion-and-talks-in-autoregression-06f564dcb4c6 | |||
| 22:56 | It’s Not Just One Bias: Inside the Intersectional Blind Spots of LLMs https://ai.plainenglish.io/its-not-just-one-bias-inside-the-intersectional-blind-spots-of-llms-6782bc739759 | |||
| 22:32 | The LLM design flaw I freaked out about is actually the intended design https://medium.com/@emeline.liu/the-llm-design-flaw-i-freaked-out-about-is-actually-the-intended-design-07df5339c776 | |||
| 22:26 | What I Learned from FlashEVA: Why Efficient Attention Matters More Than Bigger LLMs https://medium.com/@myakalarajkumar1998/what-i-learned-from-flasheva-why-efficient-attention-matters-more-than-bigger-llms-c237cfa35530 | |||
| 22:21 | [OpenAI] Monitoring Monitorability https://medium.com/@mdpman/openai-monitoring-monitorability-8116b665ac94 | |||
| 22:15 | All Data and AI Weekly #221–22Dec2025 https://medium.com/@tspann/all-data-and-ai-weekly-221-22dec2025-b9fc88391645 | |||
| 21:45 | Learning Arabic Online: What Actually Works (And What’s a Waste of Time) https://medium.com/@alphabtarabicacademy/learning-arabic-online-what-actually-works-and-whats-a-waste-of-time-fbc15d3bc4b2 | |||
| 21:15 | The Best Abliterated LLMs for Raw NSFW Storytelling in Late 2025 https://watsonout.medium.com/the-best-abliterated-llms-for-raw-nsfw-storytelling-in-late-2025-9fb72bbe5d79 | |||
| 21:10 | Complete MITRE ATT&CK MCP Server https://medium.com/@nsangouinoussa515/mitre-att-ck-mcp-server-ed811874dff0 | |||
| 20:55 | Past of Goal-Guided Conversational AI Models(5) https://createmomo.medium.com/past-of-goal-guided-conversational-ai-models-5-7ca82b2ce496 | |||
| 20:47 | Cutting AI Costs Starts With Better Prompts https://medium.com/write-a-catalyst/cutting-ai-costs-starts-with-better-prompts-c87d3921d951 | |||
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124