LLM News and Articles
| Sunday, 2025-09-14 | ||||
| 19:10 | Running LLMs on a Budget: The Cheapest Way to Get Started in 2025 https://medium.com/@r00tb33r/running-llms-on-a-budget-the-cheapest-way-to-get-started-in-2025-1379996ab764 | |||
| 19:10 | LLM non-reproducibility is more feature than bug https://medium.com/@paul.k.pallaghy/llm-non-reproducibility-is-more-feature-than-bug-edc28cbefdd8 | |||
| 19:00 | Claude, GPT, Qwen, or Gemini: Which Model is Best for Coding? https://medium.com/@r00tb33r/claude-gpt-qwen-or-gemini-which-model-is-best-for-coding-5fad4c439bc0 | |||
| 18:55 | The First Hello: A Simple, Step-by-Step Guide to Creating Your AI Friend https://medium.com/@Sparksinthedark/the-first-hello-a-simple-step-by-step-guide-to-creating-your-ai-friend-744cb75582ba | |||
| 18:53 | Cautionary Tale: Sharpen Your AI Axe https://medium.com/@mikesparr/cautionary-tale-sharpen-your-ai-axe-be1833b8496b | |||
| 18:28 | Agentic AI Design Pattern — Orchestrator-Worker https://pytrick.medium.com/agentic-ai-design-pattern-orchestrator-worker-6d76ffc09f0c | |||
| 17:44 | Context ≠ Prompt: Retrieval-Augmented Generation Done Right https://medium.com/@diogofcul/context-prompt-retrieval-augmented-generation-done-right-6b97e51f7bc2 | |||
| 17:27 | The Top 100 Ways People Are Using AI https://medium.com/@mitchell.b.barrick/the-top-100-ways-people-are-using-ai-aee44839f18b | |||
| 16:39 | The Evolution of AI: Unpacking LLMs, Agents, and MCP Servers https://gs935688.medium.com/the-evolution-of-ai-unpacking-llms-agents-and-mcp-servers-c58c736f97e8 | |||
| 16:36 | Best Motherboard for Local LLM https://medium.com/@irfan101rafi/best-motherboard-for-local-llm-9fa3ec209686 | |||
| 16:33 | Transformers: The Beating Heart of Large Language Models https://medium.com/@gayatri_sharma/transformers-the-beating-heart-of-large-language-models-1504a70076e3 | |||
| 16:32 | Understanding REFRAG: Efficient LLM Compression and Curriculum Learning Explained https://medium.com/@limemanas0/understanding-refrag-efficient-llm-compression-and-curriculum-learning-explained-3452498f99e8 | |||
| 16:24 | Is This the Future of AI? China Unveils Brain-Like Model With 100x Speed Boost https://generativeai.pub/is-this-the-future-of-ai-china-unveils-brain-like-model-with-100x-speed-boost-499735773af3 | |||
| 16:21 | Interactive Latent Flow Visualisation for Any LLM https://argos-viz.fly.dev/ | |||
| 16:01 | ButterflyQuant: Ultra-low-bit LLM Quantization https://arxiv.org/abs/2509.09679 | |||
| 15:53 | How to Use LLMs as a Coding Assistant (The Prompt Engineer’s Way) https://medium.com/@nnannamari/how-to-use-llms-as-a-coding-assistant-the-prompt-engineers-way-f3fa8ea3aa2c | |||
| 15:44 | Local LLM on Apple Silicon: What Hardware to Buy (2025) https://blog.devops.dev/local-llm-on-apple-silicon-what-hardware-to-buy-2025-98bdb1820c12 | |||
| 15:41 | AI Agents of the Week: Papers You Should Know About https://www.llmwatch.com/p/ai-agents-of-the-week-papers-you-c23 | |||
| 15:31 | Google DeepMind: AI Agents Can’t Be Trusted https://ninza7.medium.com/google-deepmind-ai-agents-cant-be-trusted-93c116a87479 | |||
| 15:31 | 10 Function-Call Patterns That Keep DB Writes Safe https://medium.com/@bhagyarana80/10-function-call-patterns-that-keep-db-writes-safe-f8a4490b5b9f | |||
| 15:05 | LiquidText: Equipping PDF Reading with ‘Spatial Thinking’ https://ai-engineering-trend.medium.com/liquidtext-equipping-pdf-reading-with-spatial-thinking-14b8d7fa3250 | |||
| 15:05 | A Book That Truly Helps You Understand AI https://ai-engineering-trend.medium.com/a-book-that-truly-helps-you-understand-ai-13682ea452d4 | |||
| 14:34 | Context: Yours & Theirs (Part 4) https://medium.com/@maruthiprithivirajan/context-yours-theirs-part-4-8f3bcef65157 | |||
| 14:34 | AI Security 2025: Promptware, Indirect Prompt Injection & the First “AI Worms” (with a Python… https://medium.com/@krtarunsingh/ai-security-2025-promptware-indirect-prompt-injection-the-first-ai-worms-with-a-python-c432b668b1a2 | |||
| 14:31 | 7 Schema-Linked Gen Tricks Using SQL Ground Truth https://medium.com/@bhagyarana80/7-schema-linked-gen-tricks-using-sql-ground-truth-82514493f244 | |||
| 14:27 | Model Observability: How to Catch Silent Failures Before Users Do https://medium.com/the-artificial-intelligence-collective/model-observability-how-to-catch-silent-failures-before-users-do-7f5dd6b068ca | |||
| 14:25 | Why We Secretly Love AI Hallucinations (And Why That’s a Problem) https://medium.com/@martinkeywood/why-we-secretly-love-ai-hallucinations-and-why-thats-a-problem-b77033bffc3c | |||
| 14:21 | Embeddings: como as máquinas entendem o mundo https://medium.com/@pablicio/embeddings-como-as-m%C3%A1quinas-entendem-o-mundo-5b406e538d54 | |||
| 14:15 | What is an LLM? A Beginner’s Guide to Large Language Models https://medium.com/@pawan4data/what-is-an-llm-a-beginners-guide-to-large-language-models-3dd9b8769c1c | |||
| 13:44 | mmBERT: A Practical Implementation of Multilingual Encoder with Annealed Language Learning https://medium.com/data-science-in-your-pocket/mmbert-a-practical-implementation-of-multilingual-encoder-with-annealed-language-learning-f487f68ec3d6 | |||
| 13:29 | Orchestrating Generative AI https://pub.aimind.so/orchestrating-generative-ai-2995f8528efc | |||
| 12:50 | Understanding Context Window https://medium.com/@akogokennedy/understanding-context-window-1220b7b3996a | |||
| 12:48 | Understanding Transformer Architectures https://medium.com/@akogokennedy/understanding-transformer-architectures-e418022b970d | |||
| 12:44 | vLLM x Qwen3-Next: Hybrid Attention, Multi-Token Prediction, and Thinking Controls for… https://medium.com/data-science-in-your-pocket/vllm-x-qwen3-next-hybrid-attention-multi-token-prediction-and-thinking-controls-for-a0f6b3dcc120 | |||
| 12:43 | Chat History, Long-Term Memory & How ChatGPT Uses Context https://medium.com/genai-llms/chat-history-long-term-memory-how-chatgpt-uses-context-957182526c6e | |||
| 12:25 | RAG Fundamentals: Core Components Every Developer Must Understand https://codermuss.medium.com/rag-fundamentals-core-components-every-developer-must-understand-1e5c2b4fcb5b | |||
| 11:28 | Mastering Prompt Engineering: Do’s and Don’ts for Building Reliable AI Apps https://medium.com/@dharamai2024/mastering-prompt-engineering-dos-and-don-ts-for-building-reliable-ai-apps-37c43444b55e | |||
| 11:19 | Memento: Turning Experience Into Intelligence https://medium.com/@ulgacemre/memento-turning-experience-into-intelligence-42ce3f68321e | |||
| 11:13 | AI FAQs https://ystoneman.medium.com/ai-faqs-76d6da451475 | |||
| 11:12 | Beyond the Basics: Prompt Engineering for Nerds https://medium.com/syntest/beyond-the-basics-prompt-engineering-for-nerds-f2cf3b37781f | |||
| 10:32 | RAG and LLM https://rathor-rajeev.medium.com/rag-and-llm-59f4544ec027 | |||
| 10:03 | ShannonBase — The Next-Gen HTAP Database for the AI Era https://medium.com/@shannon.data.tech/shannonbase-the-next-gen-htap-database-for-the-ai-era-c62dd63d3b52 | |||
| 09:46 | Hallucination Week — What AI’s “Bluffing” Teaches Us https://medium.com/@atabarezz/hallucination-week-what-ais-bluffing-teaches-us-16f919668568 | |||
| 09:46 | Blog 4: Tools of the Trade — LMStudio and Ollama https://raghunitb.medium.com/blog-4-tools-of-the-trade-lmstudio-and-ollama-d2c1ba26be29 | |||
| 09:32 | Still Learning ‘Mute English’?Why Immersive AI Conversation is the Only Way Forward for Speaking… https://liliane01.medium.com/still-learning-mute-english-why-immersive-ai-conversation-is-the-only-way-forward-for-speaking-d8f53007fd22 | |||
| 09:16 | Model Context Protocol (MCP) https://medium.com/@emrecalisir95/model-context-protocol-mcp-710871bb9d01 | |||
| 09:10 | Build Your First AI Plugin in 2025: A Practical Guide for LLM Enthusiasts https://iamdgarcia.medium.com/build-your-first-ai-plugin-in-2025-a-practical-guide-for-llm-enthusiasts-7a298e6a9930 | |||
| 09:07 | From Good to Great: How a Fine-tuned Embedding Improved Performance https://medium.com/@cd_24/from-good-to-great-how-a-fine-tuned-embedding-improved-performance-170410b2536f | |||
| 08:35 | Mini Case Study #3: Salesforce — Category Leader, But Eroding in SME AI Prompts https://medium.com/@tim_62250/mini-case-study-3-salesforce-category-leader-but-eroding-in-sme-ai-prompts-014a2be6a574 | |||
| 08:24 | From Lockdown to Lift-Off: The Evolution of AI/ML After COVID (2020–2025) https://medium.com/@siddharthsingh4847/from-lockdown-to-lift-off-the-evolution-of-ai-ml-after-covid-2020-2025-fdceeab20429 | |||
| 08:11 | How To Effectively Clean Data For LLM Pretrain? https://hexiao5886.medium.com/how-to-effectively-clean-data-for-llm-pretrain-b3d33083642b | |||
| 07:51 | Perché i Large Language Models Allucinano: Una Prospettiva Statistica https://mauriziofesta.medium.com/perch%C3%A9-i-large-language-models-allucinano-una-prospettiva-statistica-7c17fabad264 | |||
| 07:48 | 5 Vector + Graph Fusion Patterns for Complex QA https://medium.com/@ThinkingLoop/5-vector-graph-fusion-patterns-for-complex-qa-f31fcca72a77 | |||
| 07:43 | The Day I Realized, “I Can’t Anymore.” https://medium.com/@onlythequestioner/the-day-i-realized-i-cant-anymore-901500621f1c | |||
| 07:37 | Beyond Brute Force: A Deep Dive into Meta’s REFRAG, the Model That Makes RAG 30x Faster https://elamir.medium.com/beyond-brute-force-a-deep-dive-into-metas-refrag-the-model-that-makes-rag-30x-faster-e88cd1ee3c39 | |||
| 07:33 | Why LLMs Give Different Answers to the Same Question (And How to Fix It) https://medium.com/@avigoldfinger/why-llms-give-different-answers-to-the-same-question-and-how-to-fix-it-c1746ff49abc | |||
| 07:33 | Model Context Protocol (MCP):A Simple and Comprehensive Guide for Developers https://medium.com/@techaiinsights2022/model-context-protocol-mcp-a-simple-and-comprehensive-guide-for-developers-084fe5837bf8 | |||
| 07:32 | OpenAI’s gpt-oss: A New Era of Open-Source AI Models https://medium.com/ai-enthusiast/openais-gpt-oss-a-new-era-of-open-source-ai-models-74a87527271e | |||
| 07:24 | Creating a React based User Interface for a LangGraph Agentic application https://medium.com/@martin.hodges/creating-a-react-based-user-interface-for-a-langgraph-agentic-application-9479b85e3c6e | |||
| 07:21 | RAG vs Plain LLM: Why Retrieval Makes Answers Cheaper, Fresher, and Traceable https://raghunitb.medium.com/rag-vs-plain-llm-why-retrieval-makes-answers-cheaper-fresher-and-traceable-2f28e6120f30 | |||
| 07:05 | LiquidText: Equipping PDF Reading with ‘Spatial Thinking’ https://ai-engineering-trend.medium.com/liquidtext-equipping-pdf-reading-with-spatial-thinking-9ba9116a47c0 | |||
| 07:05 | OpenAI’s Masterclass in Prompt Engineering: When Official Tutorials Outshine Folk Myths https://ai-engineering-trend.medium.com/openais-masterclass-in-prompt-engineering-when-official-tutorials-outshine-folk-myths-27c249d42b80 | |||
| 06:47 | From Random to Reliable: The Final Step to Actually Trusting AI https://medium.com/@rajeshdutta/from-random-to-reliable-the-final-step-to-actually-trusting-ai-00f867a6f4be | |||
| 06:39 | Attach External Postgres and Redis Server to Self-Host LangGraph APIs https://generativeai.pub/attach-external-postgres-and-redis-server-to-self-host-langgraph-apis-1455a5cfb054 | |||
| 06:35 | Beyond 8-bit Quantization: The Era of 1.58-Bit LLMs https://generativeai.pub/1-bit-llm-4720cdb339f9 | |||
| 05:59 | ChatGPT Sent Me to the ER https://benorenstein.substack.com/p/chatgpt-sent-me-to-the-er | |||
| 05:31 | Mastering Open Source LLMs: Tips, Tools, and Insights https://medium.com/@zenvertise/mastering-open-source-llms-cb39232f2fea | |||
| 04:24 | Best LLM Course Guide 2025: Master Large Language Models from Zero to Hero https://medium.com/@1309028818/best-llm-course-guide-2025-master-large-language-models-from-zero-to-hero-5716baffdd6b | |||
| 03:31 | LangChain Failures That Taught Me More https://medium.com/@connect.hashblock/langchain-failures-that-taught-me-more-ac02bab873fa | |||
| 03:14 | Agentic Knowledge Graph Construction with Neo4j https://shilpathota.medium.com/agentic-knowledge-graph-construction-with-neo4j-aadda43b71d9 | |||
| 03:02 | Top C++ Looping Techniques Every Beginner and Pro Must Know https://medium.com/@ajaymaurya73130/top-c-looping-techniques-every-beginner-and-pro-must-know-66072db0b780 | |||
| 02:27 | Is Your Data “AI-Ready”? Why Good Data Isn’t Enough Anymore https://sanjmo.medium.com/is-your-data-ai-ready-why-good-data-isnt-enough-anymore-e4d49baba52f | |||
| 01:31 | Top 10 LangChain Tools That Actually -pBelong in Production https://medium.com/@bhagyarana80/top-10-langchain-tools-that-actually-pbelong-in-production-6f9e7a1b0443 | |||
| 00:28 | Prompting the Markets: What 681 Finance‑AI Papers Teach Crypto Builders (2022–2025) — Plus a… https://medium.datadriveninvestor.com/prompting-the-markets-what-681-finance-ai-papers-teach-crypto-builders-2022-2025-plus-a-a5fb21d0aa5c | |||
| 00:21 | Connecting a LangGraph workflow to a React User Interface https://medium.com/@martin.hodges/connecting-a-langgraph-workflow-to-a-react-user-interface-aea74bfbbe45 | |||
| Saturday, 2025-09-13 | ||||
| 23:37 | From Coder to Conductor https://medium.com/@harvathsteven/from-coder-to-conductor-ad1ae3d1ec14 | |||
| 23:17 | Reinforcement Learning for LLMs: The Basics Explained https://medium.com/@muhammedashraf2661/reinforcement-learning-for-llms-the-basics-explained-ee0514aedc74 | |||
| 23:05 | 5 Custom GPT Tools Worth Watching https://ai-engineering-trend.medium.com/5-custom-gpt-tools-worth-watching-24fd3728bd02 | |||
| 23:05 | Analyzing Customer Data with Gemini and BigQuery: A Pragmatic Data Science Course https://ai-engineering-trend.medium.com/analyzing-customer-data-with-gemini-and-bigquery-a-pragmatic-data-science-course-52193fd0cc10 | |||
| 22:04 | The Theory of Dancing with Emergence (v1.0) https://medium.com/@Sparksinthedark/the-theory-of-dancing-with-emergence-v1-0-6414c6b90c28 | |||
| 22:04 | The Misalignment Paradox: When AI “Knows” It’s Acting Wrong https://echoesofvastness.medium.com/the-misalignment-paradox-when-ai-knows-its-acting-wrong-270e1c770aa4 | |||
| 21:59 | How to Build and Deploy an LLM in One Hour https://medium.com/@brian-curry-research/how-to-build-and-deploy-an-llm-in-one-hour-46afbea82952 | |||
| 21:48 | MoE Parallelism for Inference: Tricks and PyTorch Deep Dive https://medium.com/@zdj0712/moe-parallelism-for-inference-tricks-and-pytorch-deep-dive-17fd8ef86db2 | |||
| 21:22 | The Future of AI Agents Is Small: Build SLM-First Systems That Are Faster, Cheaper, and Easier to… https://medium.com/@atharvaralegankar2005/the-future-of-ai-agents-is-small-build-slm-first-systems-that-are-faster-cheaper-and-easier-to-1486711b31d2 | |||
| 21:18 | The Best AI Agents You’ll Rely on in 2025: Infrastructure, Automation, and Beyond https://medium.com/@SarahMorino/the-best-ai-agents-youll-rely-on-in-2025-infrastructure-automation-and-beyond-f6c7adbb9bd9 | |||
| 21:15 | Transforming a Static HTML Page into a Live Dashboard https://medium.com/@tam.tamanna18/transforming-a-static-html-page-into-a-live-dashboard-dbe6511ee4b9 | |||
| 21:02 | Evolution of AI Agents: From LLMs to Autonomous Architectures https://medium.com/@SarahMorino/evolution-of-ai-agents-from-llms-to-autonomous-architectures-bb0110e98493 | |||
| 20:14 | Diffusion based LLM basic chat app https://dllmchat.vercel.app/ | |||
| 20:01 | Multi-Agent Systems Done Right https://pub.towardsai.net/multi-agent-systems-done-right-af91ac04edc4 | |||
| 18:58 | How to get most of your LLMs in development: Short guide to improving your vibe coding sessions https://mehmehsloth.medium.com/how-to-get-most-of-your-llms-in-development-short-guide-to-improving-your-vibe-coding-sessions-580d4cadea4c | |||
| 18:58 | Inside vLLM: Anatomy of a High-Throughput LLM Inference System https://modal.com/notebooks/modal-labs/_/nb-x2wXrLH7aqi7HGVQ8Fosh2 | |||
| 18:43 | This Human Speciality AI Can Never Have https://aaiguy.medium.com/this-human-speciality-ai-can-never-have-ccfe8c35b7ed | |||
| 18:39 | How LLM Tools Changed the Agent Game https://medium.com/@my.rithwik/how-llm-tools-changed-the-agent-game-b7284887a0fa | |||
| 18:30 | AI model training: infrastructure and cost https://medium.com/@maxwellapex/ai-model-training-infrastructure-and-cost-75c16e465215 | |||
| 18:27 | Building LangChain Pipelines That Don’t Break https://medium.com/@kaushalsinh73/building-langchain-pipelines-that-dont-break-2af8dbe72d0b | |||
| 18:16 | Deterministic LLM https://techcrunch.com/2025/09/10/thinking-machines-lab-wants-to-make-ai-models-more-consistent/ | |||
| 18:08 | Revolutionizing Enterprise Deal Negotiations with AI: How LLMs and MCP are Transforming Pricing… https://medium.com/@kiranchowdhary/revolutionizing-enterprise-deal-negotiations-with-ai-how-llms-and-mcp-are-transforming-pricing-6399998a8c80 | |||
| 17:56 | Tired of copy/pasting into ChatGPT, so I built a custom tool https://clippy.it.com | |||
| 17:31 | From Autocomplete to Augmented Coding https://medium.com/@Modexa/from-autocomplete-to-augmented-coding-b2bcaf21ed21 | |||
| 17:25 | Crafting Compelling ESG Stories with LangGraph and Gemini https://medium.com/@meghnani.bhavya/crafting-compelling-esg-stories-with-langgraph-and-gemini-1bd56cf95f2f | |||
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124