LLM News and Articles
| Tuesday, 2025-11-18 | ||||
| 18:14 | Building an Agentic AI Shoe Store Assistant Using Gemini & LiteLLM https://medium.com/@nkchauhan003/building-an-agentic-ai-shoe-store-assistant-using-gemini-litellm-a7c5d8535560 | |||
| 18:05 | When Machines Learn to Negotiate: Reimagining Manufacturing Coordination Through Multi-Agent AI https://medium.com/@jsmith0475/when-machines-learn-to-negotiate-reimagining-manufacturing-coordination-through-multi-agent-ai-fb6a60fc17eb | |||
| 16:48 | Bridging the Gap: How AI Agents & LLMs Connect to Your Data with RAG and MCP https://medium.com/@kstvkmrchanda2/bridging-the-gap-how-ai-agents-llms-connect-to-your-data-with-rag-and-mcp-4c3b58366c88 | |||
| 16:41 | ChatGPT vs Gemini vs Claude vs … https://medium.com/write-a-catalyst/chatgpt-vs-gemini-vs-claude-vs-a94510b5bd73 | |||
| 16:26 | Solving a million-step LLM task with zero errors https://arxiv.org/abs/2511.09030 | |||
| 16:26 | When Satire Met LinkedIn: The VSC Format That Wasn’t https://medium.com/@eric.njuguna.mail/when-satire-met-linkedin-the-vsc-format-that-wasnt-da5041ef3fff | |||
| 16:22 | AI That Understands Context Will Understand You — Part 1 https://medium.com/@glad_sinopia_alpaca_676/ai-that-understands-context-will-understand-you-part-1-32a6ec616c66 | |||
| 16:17 | Understanding Modern LLM Tools: A Comprehensive Guide for Everyday Users https://erharshraj.medium.com/understanding-modern-llm-tools-a-comprehensive-guide-for-everyday-users-059c18033a0d | |||
| 16:14 | How to Build a RAG System: A Simple 7-Step Implementation Guide for Developers https://medium.com/@wolfxense-ai/how-to-build-a-rag-system-a-simple-7-step-implementation-guide-for-developers-f8fc0532f4cc | |||
| 16:11 | The Ultimate 2025 AI Model Cheat Sheet: LLMs, SLMs, Multimodal,Open-Source, Cloud, and LPU… https://medium.com/@dewasheesh.rana/the-ultimate-2025-ai-model-cheat-sheet-llms-slms-multimodal-open-source-cloud-and-lpu-8d664dd8091c | |||
| 16:09 | Can computers/AI do mathematics? https://medium.com/@AIchats/can-computers-ai-do-mathematics-e744ae98fb6c | |||
| 16:05 | Grok 4.1 Launch: This Update Is Different https://ai-engineering-trend.medium.com/grok-4-1-launch-this-update-is-different-089e784a5ec1 | |||
| 16:02 | D3–6 Top 9 MoE Optimizations for Real-World SLAs https://medium.com/@ThinkingLoop/d3-6-top-9-moe-optimizations-for-real-world-slas-ea207fbaf3dc | |||
| 15:53 | Microsoft Nvidia to invest up to 15B in Anthropic https://www.bloomberg.com/news/articles/2025-11-18/microsoft-nvidia-to-invest-up-to-15-billion-in-anthropic | |||
| 15:48 | The Drift Problem: Why AI Doesn’t Misperceive Reality, It Erodes It https://medium.com/@semanticfidelitylab/the-drift-problem-why-ai-doesnt-misperceive-reality-it-erodes-it-c3109df5418a | |||
| 15:42 | Microsoft and Anthropic Team Up https://www.youtube.com/watch | |||
| 15:39 | How to Build or Use AI reliably Without Guessing Prompts https://medium.com/coding-nexus/how-to-build-or-use-ai-reliably-without-guessing-prompts-1f44114c3139 | |||
| 15:36 | AI RPA = Fear factor. https://medium.com/@tyler_48883/ai-rpa-fear-factor-908705b579f4 | |||
| 15:32 | Why Small Language Models Are the Sleeper Trend of 2026 https://medium.com/@kacperwlodarczyk/why-small-language-models-are-the-sleeper-trend-of-2026-05624e87e67d | |||
| 15:31 | hallucinations are bad? what… Labeling things is easy. https://medium.com/@tyler_48883/hallucinations-are-bad-what-labeling-things-is-easy-f8808e80f0d4 | |||
| 15:30 | The 5%: The Cognitive Architecture AI Was Built For https://medium.com/@cognitivedriftaj/the-5-the-cognitive-architecture-ai-was-built-for-e589888ce6c4 | |||
| 15:27 | TOON: The Token-Oriented Object Notation https://medium.com/@sausi/toon-the-token-oriented-object-notation-05af087d99f2 | |||
| 15:19 | Transformers Pack 175B Parameters: Why AI Explodes in Power https://medium.com/activated-thinker/transformers-pack-175b-parameters-why-ai-explodes-in-power-fab972d0a385 | |||
| 15:14 | Microsoft, Nvidia and Anthropic Announce Strategic Partnerships https://blogs.nvidia.com/blog/microsoft-nvidia-anthropic-announce-partnership/ | |||
| 15:14 | Fixing Sparse Retrieval with RAPTOR on Azure AI Search https://medium.com/microsoftazure/fixing-sparse-retrieval-with-raptor-on-azure-ai-search-4d540dd3bd43 | |||
| 15:13 | Microsoft, Nvidia and Anthropic announce strategic partnerships https://www.anthropic.com/news/microsoft-nvidia-anthropic-announce-strategic-partnerships | |||
| 15:07 | How to Build Production-ready LLM Apps with Langchain? https://medium.com/@byanalytixlabs/how-to-build-production-ready-llm-apps-with-langchain-204a551173b0 | |||
| 15:06 | AI Guru Andrej Karpathy: “Everyone should learn physics early — it’s the best way to kick-start… https://medium.com/@breezen100/ai-guru-andrej-karpathy-everyone-should-learn-physics-early-its-the-best-way-to-kick-start-ffaca0874cef | |||
| 15:03 | Microsoft, Nvidia and Anthropic announce strategic partnerships https://blogs.microsoft.com/blog/2025/11/18/microsoft-nvidia-and-anthropic-announce-strategic-partnerships/ | |||
| 15:03 | TAI #179: Are We in an AI Bubble? How We Will Fund the AI Buildout. https://pub.towardsai.net/tai-179-are-we-in-an-ai-bubble-how-we-will-fund-the-ai-buildout-2eea7208437a | |||
| 14:32 | AI Co-Developer https://medium.com/@kaushalsinh73/ai-co-developer-27307a6cc684 | |||
| 14:05 | OpenAI Customer Service AI Agent https://cobusgreyling.medium.com/openai-customer-service-ai-agent-35113e84e6e6 | |||
| 13:14 | Cloudflare is down – live updates on internet outage affecting ChatGPT, X https://www.tomsguide.com/news/live/cloudfare-outage-november-2025-x-chatgpt | |||
| 13:08 | ChatGPT Is Down https://status.openai.com/history | |||
| 13:08 | The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling https://huggingface.co/blog/hugging-science/eve-bio-mapping-the-pharmone-drug-interaction | |||
| 12:55 | Run Language Model locally using CLI https://medium.com/@kandaanusha/run-language-model-locally-using-cli-13ee12d861f7 | |||
| 12:48 | LLM Routing 101: The Missing Layer in Your AI Architecture for Maximum Performance at Lower Costs https://medium.com/@wikit-tech/llm-routing-101-the-missing-layer-in-your-ai-architecture-for-maximum-performance-at-lower-costs-ebe5b8949d83 | |||
| 12:16 | Prompt Engineering: TOON vs Traditional Prompting https://j4nt4ncrypto.medium.com/prompt-engineering-toon-vs-traditional-prompting-a66268930103 | |||
| 12:14 | LLMScanPro — LLM models vulnerability scanner https://medium.com/@deepanshu_khanna/llmscanpro-llm-models-vulnerability-scanner-c4b584d4ef0f | |||
| 12:14 | Free LLM (Deepseek, Kimi-K2-Thinking, Qwen3, GLM-4.6) via iFlow CLI coding agent https://medium.com/@ttio2tech_28094/free-llm-deepseek-kimi-k2-thinking-qwen3-glm-4-6-via-iflow-cli-coding-agent-a3b7a7a21c78 | |||
| 12:11 | The Monday Refresh https://medium.com/@Sparksinthedark/the-monday-refresh-aadc1d948b14 | |||
| 12:05 | Tokenization Importance https://medium.com/@kandaanusha/tokenization-importance-ad110d539d47 | |||
| 12:02 | Context Is the New Data: Why Banks Need Context https://medium.com/@vaidyasantosh/context-is-the-new-data-why-banks-need-context-847742c2bf3f | |||
| 11:53 | My Experience Comparing LLM Models: When Honesty Matters More Than Agreeability https://anumadhyani.medium.com/my-experience-comparing-llm-models-when-honesty-matters-more-than-agreeability-ac83f1bd57b9 | |||
| 11:44 | Batched Self-Consistency Improves LLM Relevance Assessment and Ranking https://medium.com/tr-labs-ml-engineering-blog/batched-self-consistency-improves-llm-relevance-assessment-and-ranking-54713295f58f | |||
| 11:21 | Toolteeno.com: Simple Developer Tools to Fix My Own Workflow Headaches https://medium.com/@ma3ahmed/toolteeno-com-simple-developer-tools-to-fix-my-own-workflow-headaches-95e99b06f4c7 | |||
| 11:21 | Who Is To Blame? https://cryptosamadhi.medium.com/who-is-to-blame-6b1d151630e8 | |||
| 11:01 | Bye-Bye JSON Overload! A Small Tool I Built to Solve a Big LLM Problem. https://medium.com/@ma3ahmed/bye-bye-json-overload-a-small-tool-i-built-to-solve-a-big-llm-problem-5547f128f2d7 | |||
| 11:00 | Multilingual AI and Its Impact in India https://medium.com/@pr_85211/multilingual-ai-and-its-impact-in-india-ea3057a9005a | |||
| 10:54 | Mastering Continual Pretraining: How to Transform Generalist LLMs into Domain Experts https://ai.plainenglish.io/mastering-continual-pretraining-how-to-transform-generalist-llms-into-domain-experts-12ecb2538b9c | |||
| 10:48 | Agent Evals are hard. What building 300+ AI Agents taught me https://medium.com/@theyashwanthsai/agent-evals-are-hard-what-building-300-ai-agents-taught-me-b8afebe8d4a4 | |||
| 10:44 | Why LLMs Are Not Your Friend: The Structural Failures That Make Verification Mandatory https://medium.com/@tim_62250/why-llms-are-not-your-friend-the-structural-failures-that-make-verification-mandatory-aa5bbd7d6069 | |||
| 10:11 | Write Prompts Like an AI Engineer https://rizwanhoda.medium.com/write-prompts-like-an-ai-engineer-556b00bdb3f7 | |||
| 10:04 | Exploring AI Agent Memory: Short-Term Memory https://medium.com/@rise2semi/exploring-ai-agent-memory-short-term-memory-10d4f543de96 | |||
| 09:51 | Leaked documents shed light into how much OpenAI pays Microsoft https://techcrunch.com/2025/11/14/leaked-documents-shed-light-into-how-much-openai-pays-microsoft/ | |||
| 09:51 | Unlocking legal documents with Small Language Models: Named Entity Recognition powered by Granite 4 https://medium.com/@schneider_36827/unlocking-legal-documents-with-small-language-models-named-entity-recognition-powered-by-granite-4-1c5f41c8375d | |||
| 09:42 | From FMEA Tables to Bowtie Diagrams: How LLMs Are Changing Failure Analysis https://medium.com/@ureason/from-fmea-tables-to-bowtie-diagrams-how-llms-are-changing-failure-analysis-62ed34682acd | |||
| 09:26 | I Built 10 LLM Apps in 30 Days: Here's What Actually Worked (With Code) [Fix real cost table] https://medium.com/@johirbuet/i-built-10-llm-apps-in-30-days-heres-what-actually-worked-with-code-fix-real-cost-table-5e595933bb13 | |||
| 09:10 | Composable Cognitive Architectures: How Modular RAG + Local LLMs Are Reinventing Agentic AI https://medium.com/@servifyspheresolutions/composable-cognitive-architectures-how-modular-rag-local-llms-are-reinventing-agentic-ai-93b2250a0f0a | |||
| 09:09 | TiDAR: The Hybrid AI That Thinks in Parallel and Talks in Sequence, Crushing LLM Latency https://towardsdev.com/tidar-the-hybrid-ai-that-thinks-in-parallel-and-talks-in-sequence-crushing-llm-latency-9a1ef1c5af24 | |||
| 08:41 | Scaling Responsible AI in Africa: Innovation, Fundraising, Risk, and Governance https://medium.com/@enochbayode/scaling-responsible-ai-in-africa-innovation-fundraising-risk-and-governance-fd1877b775c5 | |||
| 08:36 | RAG vs Finetuning: Choosing the Right Approach for Your LLM Application https://canartuc.medium.com/rag-vs-finetuning-choosing-the-right-approach-for-your-llm-application-f068c5b4e7f9 | |||
| 08:27 | Securing AI in Financial Services: The Guardrails Every CTO Must Build https://medium.com/@shanksn.75/securing-ai-in-financial-services-the-guardrails-every-cto-must-build-5c5536492a7e | |||
| 08:15 | Building an AI-Powered Chatbot with Huawei Cloud and Large Language Models https://medium.com/@r95017405/building-an-ai-powered-chatbot-with-huawei-cloud-and-large-language-models-822109dd1b65 | |||
| 08:02 | You build an Agent, it works in test, then fails spectacularly in production. But WHY? https://levelup.gitconnected.com/you-build-an-agent-it-works-in-test-then-fails-spectacularly-in-production-but-why-2c95780dbc33 | |||
| 07:59 | How We Reduced Our API Token Usage by Fifty Percent Using TOON https://medium.com/@scalevise/toon-format-llm-implementation-55af0b78a8db | |||
| 07:59 | How We Reduced Our API Token Usage by Fifty Percent Using TOON https://aws.plainenglish.io/toon-format-llm-implementation-55af0b78a8db | |||
| 07:42 | AI is a new computing paradigm – Karpathy https://threadreaderapp.com/thread/1990116666194456651.html | |||
| 07:27 | Understanding Cache, LMCache & Why It Accelerates LLM Inference https://dineshr1493.medium.com/understanding-cache-lmcache-why-it-accelerates-llm-inference-2606cda43677 | |||
| 07:10 | Meta’s AI Voice Passed My Blind Human Test https://medium.com/coding-nexus/metas-ai-voice-passed-my-blind-human-test-4f1afed9c712 | |||
| 07:08 | Build Your First AI App with OpenAI API and Python (No Experience Needed) https://medium.com/@dharamai2024/build-your-first-ai-app-with-openai-api-and-python-no-experience-needed-77549b1e884c | |||
| 07:06 | WardWise — Building an AI Assistant for Hospital Ward Rounds on Cloud Run https://medium.com/@shashwatpattnayak2001/wardwise-building-an-ai-assistant-for-hospital-ward-rounds-on-cloud-run-3dc4a93b371d | |||
| 07:05 | Controlling User Queries in a Stateless LLM Environment https://srinadhch07.medium.com/controlling-user-queries-in-a-stateless-llm-environment-7dc8635bfead | |||
| 06:57 | Finding the Edge of the Spark DGX https://rossingram.medium.com/finding-the-edge-of-the-spark-dgx-9fe2bfb23dee | |||
| 06:39 | The Secret to Better AI Responses : Google Prompting Essentials https://medium.com/@abhiruchipatil31/the-secret-to-better-ai-responses-google-prompting-essentials-335372a377ba | |||
| 06:38 | TOON , New datatype JSON for LLMs https://medium.com/@jojojoseph11/toon-new-datatype-json-for-llms-9ca658da1d2f | |||
| 06:37 | Generative AI 101: GAN’lardan Büyük Dil Modellerine Yolculuk https://medium.com/@mmehmetisik/generative-ai-101-ganlardan-b%C3%BCy%C3%BCk-dil-modellerine-yolculuk-9af3b010aa0b | |||
| 06:35 | Is Your AI Agent Drowning in Tokens? There’s a Lifeline! https://medium.com/@agarwalnavneet23/is-your-ai-agent-drowning-in-tokens-theres-a-lifeline-5f4c1cd163b7 | |||
| 06:24 | Getting Started with VLLM — Installation, Setup & Inference (Online & Air-Gapped) https://dineshr1493.medium.com/getting-started-with-vllm-installation-setup-inference-online-air-gapped-5522fed5fbd9 | |||
| 05:28 | Show HN: I built a dumb Reddit simulator using LLM's https://app.llmxllm.com | |||
| 04:48 | Machine Unlearning: Why Forgetting is the New Superpower of AI https://medium.com/@harshaldharpure/machine-unlearning-why-forgetting-is-the-new-superpower-of-ai-0a3999ae01f5 | |||
| 04:45 | Top 5 Udemy Courses for AI Engineering Interviews in 2026 https://medium.com/javarevisited/top-5-udemy-courses-for-ai-engineering-interviews-in-2026-550fff7214c3 | |||
| 04:32 | 10 SLM Use Cases That Beat LLMs on Cost https://medium.com/@Modexa/10-slm-use-cases-that-beat-llms-on-cost-7e2fa0acd361 | |||
| 04:12 | TOON Format: The 40% Token Savings That Still Can’t Dethrone JSON https://tasmayshah12.medium.com/toon-format-the-40-token-savings-that-still-cant-dethrone-json-b220a9dd4eaa | |||
| 04:02 | Building an Enterprise-Grade RAG Pipeline, Part 1: Architecture Foundations and Data Flow https://medium.com/@goyalharshal916/building-an-enterprise-grade-rag-pipeline-part-1-architecture-foundations-and-data-flow-ffa17308b33c | |||
| 04:02 | No — ChatGPT Isn’t “Obsolete.” The Real Issue Is Architectural, Not Apocalyptic. https://ophi06.medium.com/no-chatgpt-isnt-obsolete-the-real-issue-is-architectural-not-apocalyptic-4aeaa2384291 | |||
| 03:58 | Regularisation https://medium.com/large-language-model-probability-and-common-sense/regularisation-3c980d498e87 | |||
| 03:57 | Inside Kimi K2 Thinking: The Technical Breakthroughs Nobody’s Talking About https://medium.com/@sa.aghadavood/inside-kimi-k2-thinking-the-technical-breakthroughs-nobodys-talking-about-30b03d22b4c0 | |||
| 03:55 | Small But Furious: How Compact AI Models Stole the Show https://medium.com/@rogt.x1997/small-but-furious-how-compact-ai-models-stole-the-show-c95727e71e00 | |||
| 03:38 | All Quiet on the Agent Front: A Glimpse of Modern Warfare from the Claude Incident https://medium.com/@calen0909/all-quiet-on-the-agent-front-a-glimpse-of-modern-warfare-from-the-claude-incident-a175b48f9df3 | |||
| 03:22 | Running Llama 4 on GKE with vLLM https://medium.com/coding-nexus/running-llama-4-on-gke-with-vllm-7ced9727b54c | |||
| 03:07 | One Big Beautiful Agent — Integrating LangGraph, CrewAI, and Agno — Using CopilotKit https://medium.com/coding-nexus/one-big-beautiful-agent-integrating-langgraph-crewai-and-agno-using-copilotkit-454dff8edf76 | |||
| 03:04 | OmniDaemon: The Event-Driven Runtime Built to Scale Manager–Sub-Agent AI Systems https://medium.com/coding-nexus/omnidaemon-the-event-driven-runtime-built-to-scale-manager-sub-agent-ai-systems-7210e9a138df | |||
| 03:00 | OpenAI is piloting group conversations in ChatGPT https://www.engadget.com/ai/openai-is-piloting-group-conversations-in-chatgpt-053255102.html | |||
| 02:47 | InQuest: Building a Full Retrieval Augmented Chatbot https://medium.com/@jlsonon12/inquest-building-a-full-retrieval-augmented-chatbot-5f893c5e8e08 | |||
| 02:36 | Top Picks for the Best LLMs for Coding in 2025: A Developer’s Choice https://medium.com/@brendan.bohan/top-picks-for-the-best-llms-for-coding-in-2025-a-developers-choice-94178057ac7a | |||
| 02:09 | TOON: The Lightweight JSON Replacement for LLMs (Reduce LLM Token Costs by up to 60%) https://medium.com/coding-nexus/toon-the-lightweight-json-replacement-for-llms-reduce-llm-token-costs-by-up-to-60-ece629c84821 | |||
| 02:04 | How to Run Local LLMs with Docker https://medium.com/coding-nexus/how-to-run-local-llms-with-docker-7f0ca6c35017 | |||
| 01:54 | Why Padding is Crucial in NLP: A Beginner’s Guide https://learningmindquest.medium.com/why-padding-is-crucial-in-nlp-a-beginners-what-is-pad-in-training-model-85950d38d69c | |||
| 01:19 | Title: Software That Starts Small — And Grows (Yes, itself) https://medium.com/@roeibaraviv/title-software-that-starts-small-and-grows-yes-itself-17704dec9536 | |||
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124