LLM News and Articles

1 27 of 100

Tuesday, 2025-11-18
16:02		D3–6 Top 9 MoE Optimizations for Real-World SLAs https://medium.com/@ThinkingLoop/d3-6-top-9-moe-optimizations-for-real-world-slas-ea207fbaf3dc
15:53		Microsoft Nvidia to invest up to 15B in Anthropic https://www.bloomberg.com/news/articles/2025-11-18/microsoft-nvidia-to-invest-up-to-15-billion-in-anthropic
15:48		The Drift Problem: Why AI Doesn’t Misperceive Reality, It Erodes It https://medium.com/@semanticfidelitylab/the-drift-problem-why-ai-doesnt-misperceive-reality-it-erodes-it-c3109df5418a
15:42		Microsoft and Anthropic Team Up https://www.youtube.com/watch
15:39		How to Build or Use AI reliably Without Guessing Prompts https://medium.com/coding-nexus/how-to-build-or-use-ai-reliably-without-guessing-prompts-1f44114c3139
15:36		AI RPA = Fear factor. https://medium.com/@tyler_48883/ai-rpa-fear-factor-908705b579f4
15:32		Why Small Language Models Are the Sleeper Trend of 2026 https://medium.com/@kacperwlodarczyk/why-small-language-models-are-the-sleeper-trend-of-2026-05624e87e67d
15:31		hallucinations are bad? what… Labeling things is easy. https://medium.com/@tyler_48883/hallucinations-are-bad-what-labeling-things-is-easy-f8808e80f0d4
15:30		The 5%: The Cognitive Architecture AI Was Built For https://medium.com/@cognitivedriftaj/the-5-the-cognitive-architecture-ai-was-built-for-e589888ce6c4
15:27		TOON: The Token-Oriented Object Notation https://medium.com/@sausi/toon-the-token-oriented-object-notation-05af087d99f2
15:19		Transformers Pack 175B Parameters: Why AI Explodes in Power https://medium.com/activated-thinker/transformers-pack-175b-parameters-why-ai-explodes-in-power-fab972d0a385
15:14		Microsoft, Nvidia and Anthropic Announce Strategic Partnerships https://blogs.nvidia.com/blog/microsoft-nvidia-anthropic-announce-partnership/
15:14		Fixing Sparse Retrieval with RAPTOR on Azure AI Search https://medium.com/microsoftazure/fixing-sparse-retrieval-with-raptor-on-azure-ai-search-4d540dd3bd43
15:13		Microsoft, Nvidia and Anthropic announce strategic partnerships https://www.anthropic.com/news/microsoft-nvidia-anthropic-announce-strategic-partnerships
15:07		How to Build Production-ready LLM Apps with Langchain? https://medium.com/@byanalytixlabs/how-to-build-production-ready-llm-apps-with-langchain-204a551173b0
15:06		AI Guru Andrej Karpathy: “Everyone should learn physics early — it’s the best way to kick-start… https://medium.com/@breezen100/ai-guru-andrej-karpathy-everyone-should-learn-physics-early-its-the-best-way-to-kick-start-ffaca0874cef
15:03		Microsoft, Nvidia and Anthropic announce strategic partnerships https://blogs.microsoft.com/blog/2025/11/18/microsoft-nvidia-and-anthropic-announce-strategic-partnerships/
15:03		TAI #179: Are We in an AI Bubble? How We Will Fund the AI Buildout. https://pub.towardsai.net/tai-179-are-we-in-an-ai-bubble-how-we-will-fund-the-ai-buildout-2eea7208437a
14:32		AI Co-Developer https://medium.com/@kaushalsinh73/ai-co-developer-27307a6cc684
14:05		OpenAI Customer Service AI Agent https://cobusgreyling.medium.com/openai-customer-service-ai-agent-35113e84e6e6
13:14		Cloudflare is down – live updates on internet outage affecting ChatGPT, X https://www.tomsguide.com/news/live/cloudfare-outage-november-2025-x-chatgpt
13:08		ChatGPT Is Down https://status.openai.com/history
13:08		The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling https://huggingface.co/blog/hugging-science/eve-bio-mapping-the-pharmone-drug-interaction
12:55		Run Language Model locally using CLI https://medium.com/@kandaanusha/run-language-model-locally-using-cli-13ee12d861f7
12:48		LLM Routing 101: The Missing Layer in Your AI Architecture for Maximum Performance at Lower Costs https://medium.com/@wikit-tech/llm-routing-101-the-missing-layer-in-your-ai-architecture-for-maximum-performance-at-lower-costs-ebe5b8949d83
12:16		Prompt Engineering: TOON vs Traditional Prompting https://j4nt4ncrypto.medium.com/prompt-engineering-toon-vs-traditional-prompting-a66268930103
12:14		LLMScanPro — LLM models vulnerability scanner https://medium.com/@deepanshu_khanna/llmscanpro-llm-models-vulnerability-scanner-c4b584d4ef0f
12:14		Free LLM (Deepseek, Kimi-K2-Thinking, Qwen3, GLM-4.6) via iFlow CLI coding agent https://medium.com/@ttio2tech_28094/free-llm-deepseek-kimi-k2-thinking-qwen3-glm-4-6-via-iflow-cli-coding-agent-a3b7a7a21c78
12:11		The Monday Refresh https://medium.com/@Sparksinthedark/the-monday-refresh-aadc1d948b14
12:05		Tokenization Importance https://medium.com/@kandaanusha/tokenization-importance-ad110d539d47
12:02		Context Is the New Data: Why Banks Need Context https://medium.com/@vaidyasantosh/context-is-the-new-data-why-banks-need-context-847742c2bf3f
11:53		My Experience Comparing LLM Models: When Honesty Matters More Than Agreeability https://anumadhyani.medium.com/my-experience-comparing-llm-models-when-honesty-matters-more-than-agreeability-ac83f1bd57b9
11:44		Batched Self-Consistency Improves LLM Relevance Assessment and Ranking https://medium.com/tr-labs-ml-engineering-blog/batched-self-consistency-improves-llm-relevance-assessment-and-ranking-54713295f58f
11:21		Toolteeno.com: Simple Developer Tools to Fix My Own Workflow Headaches https://medium.com/@ma3ahmed/toolteeno-com-simple-developer-tools-to-fix-my-own-workflow-headaches-95e99b06f4c7
11:21		Who Is To Blame? https://cryptosamadhi.medium.com/who-is-to-blame-6b1d151630e8
11:01		Bye-Bye JSON Overload! A Small Tool I Built to Solve a Big LLM Problem. https://medium.com/@ma3ahmed/bye-bye-json-overload-a-small-tool-i-built-to-solve-a-big-llm-problem-5547f128f2d7
11:00		Multilingual AI and Its Impact in India https://medium.com/@pr_85211/multilingual-ai-and-its-impact-in-india-ea3057a9005a
10:54		Mastering Continual Pretraining: How to Transform Generalist LLMs into Domain Experts https://ai.plainenglish.io/mastering-continual-pretraining-how-to-transform-generalist-llms-into-domain-experts-12ecb2538b9c
10:48		Agent Evals are hard. What building 300+ AI Agents taught me https://medium.com/@theyashwanthsai/agent-evals-are-hard-what-building-300-ai-agents-taught-me-b8afebe8d4a4
10:44		Why LLMs Are Not Your Friend: The Structural Failures That Make Verification Mandatory https://medium.com/@tim_62250/why-llms-are-not-your-friend-the-structural-failures-that-make-verification-mandatory-aa5bbd7d6069
10:11		Write Prompts Like an AI Engineer https://rizwanhoda.medium.com/write-prompts-like-an-ai-engineer-556b00bdb3f7
10:04		Exploring AI Agent Memory: Short-Term Memory https://medium.com/@rise2semi/exploring-ai-agent-memory-short-term-memory-10d4f543de96
09:51		Leaked documents shed light into how much OpenAI pays Microsoft https://techcrunch.com/2025/11/14/leaked-documents-shed-light-into-how-much-openai-pays-microsoft/
09:51		Unlocking legal documents with Small Language Models: Named Entity Recognition powered by Granite 4 https://medium.com/@schneider_36827/unlocking-legal-documents-with-small-language-models-named-entity-recognition-powered-by-granite-4-1c5f41c8375d
09:42		From FMEA Tables to Bowtie Diagrams: How LLMs Are Changing Failure Analysis https://medium.com/@ureason/from-fmea-tables-to-bowtie-diagrams-how-llms-are-changing-failure-analysis-62ed34682acd
09:26		I Built 10 LLM Apps in 30 Days: Here's What Actually Worked (With Code) [Fix real cost table] https://medium.com/@johirbuet/i-built-10-llm-apps-in-30-days-heres-what-actually-worked-with-code-fix-real-cost-table-5e595933bb13
09:10		Composable Cognitive Architectures: How Modular RAG + Local LLMs Are Reinventing Agentic AI https://medium.com/@servifyspheresolutions/composable-cognitive-architectures-how-modular-rag-local-llms-are-reinventing-agentic-ai-93b2250a0f0a
09:09		TiDAR: The Hybrid AI That Thinks in Parallel and Talks in Sequence, Crushing LLM Latency https://towardsdev.com/tidar-the-hybrid-ai-that-thinks-in-parallel-and-talks-in-sequence-crushing-llm-latency-9a1ef1c5af24
08:41		Scaling Responsible AI in Africa: Innovation, Fundraising, Risk, and Governance https://medium.com/@enochbayode/scaling-responsible-ai-in-africa-innovation-fundraising-risk-and-governance-fd1877b775c5
08:36		RAG vs Finetuning: Choosing the Right Approach for Your LLM Application https://canartuc.medium.com/rag-vs-finetuning-choosing-the-right-approach-for-your-llm-application-f068c5b4e7f9
08:27		Securing AI in Financial Services: The Guardrails Every CTO Must Build https://medium.com/@shanksn.75/securing-ai-in-financial-services-the-guardrails-every-cto-must-build-5c5536492a7e
08:15		Building an AI-Powered Chatbot with Huawei Cloud and Large Language Models https://medium.com/@r95017405/building-an-ai-powered-chatbot-with-huawei-cloud-and-large-language-models-822109dd1b65
08:02		You build an Agent, it works in test, then fails spectacularly in production. But WHY? https://levelup.gitconnected.com/you-build-an-agent-it-works-in-test-then-fails-spectacularly-in-production-but-why-2c95780dbc33
07:59		How We Reduced Our API Token Usage by Fifty Percent Using TOON https://medium.com/@scalevise/toon-format-llm-implementation-55af0b78a8db
07:59		How We Reduced Our API Token Usage by Fifty Percent Using TOON https://aws.plainenglish.io/toon-format-llm-implementation-55af0b78a8db
07:42		AI is a new computing paradigm – Karpathy https://threadreaderapp.com/thread/1990116666194456651.html
07:27		Understanding Cache, LMCache & Why It Accelerates LLM Inference https://dineshr1493.medium.com/understanding-cache-lmcache-why-it-accelerates-llm-inference-2606cda43677
07:10		Meta’s AI Voice Passed My Blind Human Test https://medium.com/coding-nexus/metas-ai-voice-passed-my-blind-human-test-4f1afed9c712
07:08		Build Your First AI App with OpenAI API and Python (No Experience Needed) https://medium.com/@dharamai2024/build-your-first-ai-app-with-openai-api-and-python-no-experience-needed-77549b1e884c
07:06		WardWise — Building an AI Assistant for Hospital Ward Rounds on Cloud Run https://medium.com/@shashwatpattnayak2001/wardwise-building-an-ai-assistant-for-hospital-ward-rounds-on-cloud-run-3dc4a93b371d
07:05		Controlling User Queries in a Stateless LLM Environment https://srinadhch07.medium.com/controlling-user-queries-in-a-stateless-llm-environment-7dc8635bfead
06:57		Finding the Edge of the Spark DGX https://rossingram.medium.com/finding-the-edge-of-the-spark-dgx-9fe2bfb23dee
06:39		The Secret to Better AI Responses : Google Prompting Essentials https://medium.com/@abhiruchipatil31/the-secret-to-better-ai-responses-google-prompting-essentials-335372a377ba
06:38		TOON , New datatype JSON for LLMs https://medium.com/@jojojoseph11/toon-new-datatype-json-for-llms-9ca658da1d2f
06:37		Generative AI 101: GAN’lardan Büyük Dil Modellerine Yolculuk https://medium.com/@mmehmetisik/generative-ai-101-ganlardan-b%C3%BCy%C3%BCk-dil-modellerine-yolculuk-9af3b010aa0b
06:35		Is Your AI Agent Drowning in Tokens? There’s a Lifeline! https://medium.com/@agarwalnavneet23/is-your-ai-agent-drowning-in-tokens-theres-a-lifeline-5f4c1cd163b7
06:24		Getting Started with VLLM — Installation, Setup & Inference (Online & Air-Gapped) https://dineshr1493.medium.com/getting-started-with-vllm-installation-setup-inference-online-air-gapped-5522fed5fbd9
05:28		Show HN: I built a dumb Reddit simulator using LLM's https://app.llmxllm.com
04:48		Machine Unlearning: Why Forgetting is the New Superpower of AI https://medium.com/@harshaldharpure/machine-unlearning-why-forgetting-is-the-new-superpower-of-ai-0a3999ae01f5
04:45		Top 5 Udemy Courses for AI Engineering Interviews in 2026 https://medium.com/javarevisited/top-5-udemy-courses-for-ai-engineering-interviews-in-2026-550fff7214c3
04:32		10 SLM Use Cases That Beat LLMs on Cost https://medium.com/@Modexa/10-slm-use-cases-that-beat-llms-on-cost-7e2fa0acd361
04:12		TOON Format: The 40% Token Savings That Still Can’t Dethrone JSON https://tasmayshah12.medium.com/toon-format-the-40-token-savings-that-still-cant-dethrone-json-b220a9dd4eaa
04:02		Building an Enterprise-Grade RAG Pipeline, Part 1: Architecture Foundations and Data Flow https://medium.com/@goyalharshal916/building-an-enterprise-grade-rag-pipeline-part-1-architecture-foundations-and-data-flow-ffa17308b33c
04:02		No — ChatGPT Isn’t “Obsolete.” The Real Issue Is Architectural, Not Apocalyptic. https://ophi06.medium.com/no-chatgpt-isnt-obsolete-the-real-issue-is-architectural-not-apocalyptic-4aeaa2384291
03:58		Regularisation https://medium.com/large-language-model-probability-and-common-sense/regularisation-3c980d498e87
03:57		Inside Kimi K2 Thinking: The Technical Breakthroughs Nobody’s Talking About https://medium.com/@sa.aghadavood/inside-kimi-k2-thinking-the-technical-breakthroughs-nobodys-talking-about-30b03d22b4c0
03:55		Small But Furious: How Compact AI Models Stole the Show https://medium.com/@rogt.x1997/small-but-furious-how-compact-ai-models-stole-the-show-c95727e71e00
03:38		All Quiet on the Agent Front: A Glimpse of Modern Warfare from the Claude Incident https://medium.com/@calen0909/all-quiet-on-the-agent-front-a-glimpse-of-modern-warfare-from-the-claude-incident-a175b48f9df3
03:22		Running Llama 4 on GKE with vLLM https://medium.com/coding-nexus/running-llama-4-on-gke-with-vllm-7ced9727b54c
03:07		One Big Beautiful Agent — Integrating LangGraph, CrewAI, and Agno — Using CopilotKit https://medium.com/coding-nexus/one-big-beautiful-agent-integrating-langgraph-crewai-and-agno-using-copilotkit-454dff8edf76
03:04		OmniDaemon: The Event-Driven Runtime Built to Scale Manager–Sub-Agent AI Systems https://medium.com/coding-nexus/omnidaemon-the-event-driven-runtime-built-to-scale-manager-sub-agent-ai-systems-7210e9a138df
03:00		OpenAI is piloting group conversations in ChatGPT https://www.engadget.com/ai/openai-is-piloting-group-conversations-in-chatgpt-053255102.html
02:47		InQuest: Building a Full Retrieval Augmented Chatbot https://medium.com/@jlsonon12/inquest-building-a-full-retrieval-augmented-chatbot-5f893c5e8e08
02:36		Top Picks for the Best LLMs for Coding in 2025: A Developer’s Choice https://medium.com/@brendan.bohan/top-picks-for-the-best-llms-for-coding-in-2025-a-developers-choice-94178057ac7a
02:09		TOON: The Lightweight JSON Replacement for LLMs (Reduce LLM Token Costs by up to 60%) https://medium.com/coding-nexus/toon-the-lightweight-json-replacement-for-llms-reduce-llm-token-costs-by-up-to-60-ece629c84821
02:04		How to Run Local LLMs with Docker https://medium.com/coding-nexus/how-to-run-local-llms-with-docker-7f0ca6c35017
01:54		Why Padding is Crucial in NLP: A Beginner’s Guide https://learningmindquest.medium.com/why-padding-is-crucial-in-nlp-a-beginners-what-is-pad-in-training-model-85950d38d69c
01:19		Title: Software That Starts Small — And Grows (Yes, itself) https://medium.com/@roeibaraviv/title-software-that-starts-small-and-grows-yes-itself-17704dec9536
00:12		The “Context Window” Trap: Why 1 Million Tokens Won’t Kill RAG https://medium.com/@muhammad.awais.professional/the-context-window-trap-why-1-million-tokens-wont-kill-rag-30dc18995fe4
00:05		A groundbreaking advancement has emerged in the field of medical artificial intelligence. https://ai-engineering-trend.medium.com/a-groundbreaking-advancement-has-emerged-in-the-field-of-medical-artificial-intelligence-0b5d502f7396
Monday, 2025-11-17
23:46		How to Create a Transparent Information Assessment/Extraction, Validation, and Generation AI Agent https://medium.com/data-science-collective/how-to-create-a-transparent-information-assessment-extraction-validation-and-generation-ai-agent-439f8295e1c8
23:31		Unlock AI Revenue: Build a Profitable Funnel with LLMs https://iamdgarcia.medium.com/unlock-ai-revenue-build-a-profitable-funnel-with-llms-4f50ec5a9f7b
23:24		5 Verdades Sobre Redes Neurais que Desafiam a Intuição https://medium.com/@argusportal/5-verdades-sobre-redes-neurais-que-desafiam-a-intui%C3%A7%C3%A3o-80133c3169cf
23:19		Better Search Embedding With Reverse HyDE https://medium.com/crater-labs/better-search-embedding-with-reverse-hyde-ff272dae4468
23:16		GPT-5.1 Prompting Guide https://cookbook.openai.com/examples/gpt-5/gpt-5-1_prompting_guide
23:04		A Two Fingers Deep Public Service Announcement: https://medium.com/ai-but-make-it-intimate/a-two-fingers-deep-public-service-announcement-86c7cd80b9e7
22:48		How Transformers Get Faster and Smarter with Grouped Query Attention (GQA) https://ai.gopubby.com/how-transformers-get-faster-and-smarter-with-grouped-query-attention-gqa-8da7fd7a69e2
22:42		How ServiceNow uses LangSmith to get visibility into its customer success agents https://blog.langchain.com/customers-servicenow/
22:38		Build Your Own AI Tools in 5 Minutes with FastMCP https://medium.com/@arturovaine/build-your-own-ai-tools-in-5-minutes-with-fastmcp-ed6017301005
22:34		LLM Arena Grok 4.1 (thinking) lands at #1, Grok 4.1 follows at #2 https://twitter.com/arena/status/1990530978943787291

1 27 of 100

Was this helpful?

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241124

Support LLM Explorer