LLM News and Articles
| Friday, 2025-09-12 | ||||
| 03:55 | ImportSnare: Directed “Code Manual” Hijacking in Retrieval-Augmented Code Generation (ImportSnare… https://medium.com/@mdpman/importsnare-directed-code-manual-hijacking-in-retrieval-augmented-code-generation-importsnare-cff99216ef2b | |||
| 03:36 | Upscaling Generative AI System with DeepSeek https://yashvaantlakham73.medium.com/upscaling-generative-ai-system-with-deepseek-b968baf9c0e3 | |||
| 03:31 | 7 LangChain Tooling Anti-Patterns That Melt Agents https://medium.com/@bhagyarana80/7-langchain-tooling-anti-patterns-that-melt-agents-3588643e9644 | |||
| 03:31 | Turn Natural Language into SQL Queries with LLMs (No Coding Needed) ➡️ https://medium.com/@atnofordatascience/turn-natural-language-into-sql-queries-with-llms-no-coding-needed-%EF%B8%8F-a30628fd8827 | |||
| 03:18 | NVIDIA TensorRT Model Optimizer: The Toolkit That Makes AI Models Lighter, Faster, and Cheaper https://medium.com/coding-nexus/nvidia-tensorrt-model-optimizer-the-toolkit-that-makes-ai-models-lighter-faster-and-cheaper-81917111d6ac | |||
| 03:02 | Use Qwen3-Coder-480B-A35B-Instruct in Claude Code: A Practical Guide https://medium.com/@marketing_novita.ai/use-qwen3-coder-480b-a35b-instruct-in-claude-code-a-practical-guide-838662acbda0 | |||
| 03:02 | How to Access GPT-OSS-20B? Flexible Deployment with Ease https://medium.com/@marketing_novita.ai/how-to-access-gpt-oss-20b-flexible-deployment-with-ease-e949b39da043 | |||
| 02:57 | Why do GPTs Hallucinate? https://medium.com/@storybydhanush/why-do-gpts-hallucinate-caee008c39c4 | |||
| 02:54 | Principles of Prompting LLMs: From Basics to Breakthroughs https://medium.com/fundamentals-of-artificial-intelligence/principles-of-prompting-llms-from-basics-to-breakthroughs-b353fd3c27c5 | |||
| 02:51 | Encyclopedia Britannica sues Perplexity over AI 'answer engine' https://www.reuters.com/legal/litigation/encyclopedia-britannica-sues-perplexity-over-ai-answer-engine-2025-09-11/ | |||
| 02:31 | When to Use LangChain Over Direct API Calls https://medium.com/@kaushalsinh73/when-to-use-langchain-over-direct-api-calls-0b79536e3363 | |||
| 02:31 | Decoder-Only Transformers: Basis of GPT models https://medium.com/fundamentals-of-artificial-intelligence/decoder-only-transformers-basis-of-gpt-models-75ae4f254d95 | |||
| 02:28 | This is How We Debug LLM Training Data https://medium.com/fundamentals-of-artificial-intelligence/this-is-how-we-debug-llm-training-data-6eada34738bb | |||
| 02:06 | LLM-optimizer: Benchmark and optimize LLM inference across frameworks with ease https://github.com/bentoml/llm-optimizer | |||
| 01:43 | Why Small Language Models are the Future of Agentic AI https://devopslearning.medium.com/why-small-language-models-are-the-future-of-agentic-ai-57ef54a4648d | |||
| 01:18 | OpenAI Takes Big Steps Toward Its Long-Planned Reorganization https://www.nytimes.com/2025/09/11/technology/openai-microsoft-deal.html | |||
| 00:59 | Compound AI Systems https://surenk.medium.com/compound-ai-systems-7b44485dd741 | |||
| 00:31 | ChatGPT vs Claude — the real difference is in how they handle conversations https://wjung.medium.com/chatgpt-vs-claude-the-real-difference-is-in-how-they-handle-conversations-fd7a37dfd80d | |||
| 00:24 | MoVer: A tool that creates animations via LLM-based iterative refinement https://mover-dsl.github.io/ | |||
| 00:01 | Evaluating Large Language Models: What, Why, and How for Chatbots https://pub.towardsai.net/evaluating-large-language-models-what-why-and-how-for-chatbots-e807ed65a51e | |||
| Thursday, 2025-09-11 | ||||
| 23:58 | Building an Autonomous AI Research Agent with LangGraph and RAG https://medium.com/@maheshwari.sagars2000/building-an-autonomous-ai-research-agent-with-langgraph-and-rag-a06807f62343 | |||
| 23:54 | A joint statement from Microsoft and OpenAI https://blogs.microsoft.com/blog/2025/09/11/a-joint-statement-from-microsoft-and-openai/ | |||
| 23:51 | Speculative cascades – A hybrid approach for smarter, faster LLM inference https://research.google/blog/speculative-cascades-a-hybrid-approach-for-smarter-faster-llm-inference/ | |||
| 23:12 | Pausing at the Frontier: The Story of Asilomar, CRISPR, and AI https://medium.com/@paddub/self-regulation-in-science-asilomar-crispr-and-ai-9188a2f47bf6 | |||
| 23:05 | WeKnora: Tencent’s Open-Source Document Understanding and Retrieval Framework https://ai-engineering-trend.medium.com/weknora-tencents-open-source-document-understanding-and-retrieval-framework-a21a8464b8f9 | |||
| 22:36 | Statement on OpenAI's Nonprofit and PBC https://openai.com/index/statement-on-openai-nonprofit-and-pbc/ | |||
| 22:28 | Your Cutest Productivity Hack https://medium.com/@leonaej4000/your-cutest-productivity-hack-a068d9643506 | |||
| 22:23 | Controlling Costs in Modern AI Architectures https://medium.com/@prasmit/controlling-costs-in-modern-ai-architectures-d0fbaa89190c | |||
| 22:04 | Building an LLM-Powered Email Classifier and Responder with LangGraph, Outlines, and Pydantic https://medium.com/@eroltak/building-an-llm-powered-email-classifier-and-responder-with-langgraph-outlines-and-pydantic-f1c2580c1e47 | |||
| 22:03 | OpenAI and Microsoft agree key terms in contract renegotiation https://www.ft.com/content/f7891fd7-4e13-4767-8c0c-5b90b6471154 | |||
| 22:03 | Prompt Injection: Some sloppy cheaters who left their evidence all over ArXiv https://statmodeling.stat.columbia.edu/2025/07/07/chatbot-prompts/ | |||
| 21:51 | Eroding Human Agency: The Power of Prompts https://medium.com/@leonaej4000/eroding-human-agency-the-power-of-prompts-e43d19338646 | |||
| 21:38 | A joint statement from OpenAI and Microsoft https://openai.com/index/joint-statement-from-openai-and-microsoft/ | |||
| 21:31 | Beyond Innovation: Building AI We Can Trust https://medium.com/@phoenixarjun007/beyond-innovation-building-ai-we-can-trust-2993076c8cf7 | |||
| 21:30 | The Paradox of Brilliance: Why Our Smartest AI Still “Bluffs” And How We Can Teach It True Humility https://medium.com/@AnthonyLaneau/the-paradox-of-brilliance-why-our-smartest-ai-still-bluffs-and-how-we-can-teach-it-true-humility-444ed7f6070a | |||
| 21:11 | How LTM (Long-Term Memory) is Redefining AI Agents https://medium.com/@sdouraya3/how-ltm-long-term-memory-is-redefining-ai-agents-81a9d87c83aa | |||
| 21:11 | The Fragility Paradox: When Humans Are Also “Prompt-Dependent” https://thegoodprogrammer.medium.com/the-fragility-paradox-when-humans-are-also-prompt-dependent-a06d5a77152f | |||
| 21:08 | From FastAPI APIs to Secure MCP Tools Authentication Essentials https://medium.com/@tam.tamanna18/from-fastapi-apis-to-secure-mcp-tools-authentication-essentials-5d2053f33ace | |||
| 21:01 | An AI Resume Tailoring Agent with Python and Streamlit https://medium.com/@avinashkella/an-ai-resume-tailoring-agent-with-python-and-streamlit-aec2afc94ce3 | |||
| 21:01 | Why AI Keeps Lying to Us (And What We Can Do About It) https://oguzhankocakli.medium.com/why-ai-keeps-lying-to-us-and-what-we-can-do-about-it-ea22d58f87d0 | |||
| 20:38 | LLMOps is the Future — The Next DevOps Revolution https://medium.com/@balapm27/llmops-is-the-future-the-next-devops-revolution-f9bc5ff1297c | |||
| 20:33 | NEWSLETTER | South Korea strengthens AI strategy, Zhipu AI enticing Claude users to migrate https://medium.com/state-of-voice-portal/newsletter-south-korea-strengthens-ai-strategy-zhipu-ai-enticing-claude-users-to-migrate-a4644031d634 | |||
| 20:24 | Writing effective tools for LLM agents–using LLM agents https://www.anthropic.com/engineering/writing-tools-for-agents | |||
| 20:20 | Teaching Vector Embedding to my Mom https://medium.com/@abhishek1331975/teaching-vector-embedding-to-my-mom-da06cf3a3fbd | |||
| 20:16 | A New Era for Healthcare — When AI Meets Healthcare Data https://medium.com/@jigarmehta277/a-new-era-for-healthcare-when-ai-meets-healthcare-data-dff90c38073f | |||
| 20:15 | The Beginner’s Guide to Machine Learning for Programmers https://medium.com/@Niamh-Wordcast/the-beginners-guide-to-machine-learning-for-programmers-40eb0906fa42 | |||
| 19:59 | ✈️Tech Thursdays: Meet CrewAI — the multi-agent framework that thinks in teams https://medium.com/@gautsoni/%EF%B8%8Ftech-thursdays-meet-crewai-the-multi-agent-framework-that-thinks-in-teams-bf09087cb241 | |||
| 19:55 | OpenAI, Oracle sign 0B computing deal https://www.reuters.com/technology/openai-oracle-sign-300-billion-computing-deal-wsj-reports-2025-09-10/ | |||
| 19:43 | Challengers: An Overview of Post-Transformer Large-Scale Model Technologies https://maximliu-85602.medium.com/challengers-an-overview-of-post-transformer-large-scale-model-technologies-df5bcff2eda9 | |||
| 19:38 | Hands-On HPC Tips: What I Learned Training Language Models for My Dissertation https://beromkoh.medium.com/hands-on-hpc-tips-what-i-learned-training-language-models-for-my-dissertation-83f08a540f9a | |||
| 19:32 | Mamba for Dummies: Linear-Time LLMs Explained https://michielh.medium.com/mamba-for-dummies-linear-time-llms-explained-0d4b51efcf9f | |||
| 19:30 | AI Moderation: Inconsistencies in Hate Speech Detection Across LLM-Based Systems https://aclanthology.org/2025.findings-acl.1144/ | |||
| 19:27 | Taste Still Matters: Why Software Engineers Need More Than AI Skills in 2025 https://medium.com/data-science-collective/taste-still-matters-why-software-engineers-need-more-than-ai-skills-in-2025-d227add52d36 | |||
| 19:17 | When Investors Said “Wow!” Designing India Index from first principles https://medium.com/@saurabhswami/when-investors-said-wow-designing-india-index-from-first-principles-fe1301a021d6 | |||
| 19:04 | Writing in the age of LLMs: where the act of expression is the meaning itself https://medium.com/@ConeCells16/writing-in-the-age-of-llms-where-the-act-of-expression-is-the-meaning-itself-1e8f58c10ea0 | |||
| 18:55 | Claude’s memory architecture is the opposite of ChatGPT’s https://www.shloked.com/writing/claude-memory | |||
| 18:54 | Beyond the Spinner: Designing Fast, Sharp, and Reliable Multi-Agent AI Systems https://medium.com/@venkataSa1/beyond-the-spinner-designing-fast-sharp-and-reliable-multi-agent-ai-systems-9ea6eeea43e8 | |||
| 18:49 | LangGraph: The Power to Unlock Agentic AI https://medium.com/@p4prince2/langgraph-the-power-to-unlock-agentic-ai-9f8eb966955b | |||
| 18:27 | Building a Custom Tool for LangChain Agents https://medium.com/@kaushalsinh73/building-a-custom-tool-for-langchain-agents-2d5460921c93 | |||
| 18:21 | Build Your First Local AI Agent with the Model Context Protocol (MCP) https://vikramsamal.medium.com/build-your-first-local-ai-agent-with-the-model-context-protocol-mcp-08c0b6a1971c | |||
| 18:13 | I Trained an LLM on Stack Overflow: It Learned to Be as Toxic as the Community https://medium.com/@sohail_saifii/i-trained-an-llm-on-stack-overflow-it-learned-to-be-as-toxic-as-the-community-a4b3a088e27a | |||
| 18:07 | Mathematical research with GPT-5: a Malliavin-Stein experiment https://arxiv.org/abs/2509.03065 | |||
| 18:02 | We’ve Been Measuring AI Reasoning All Wrong. Here’s How to Fix It. https://pub.towardsai.net/weve-been-measuring-ai-reasoning-all-wrong-here-s-how-to-fix-it-7f11af09ac14 | |||
| 17:57 | Artificial Intelligence: Replacement or Reinforcement. #AiForHumans. https://medium.com/the-silent-script/artificial-intelligence-replacement-or-reinforcement-aiforhumans-affb89ee0a0a | |||
| 17:39 | Why AI Hallucinates — And How MIT’s “Semantic Firewall” Wants to Fix It https://abvcreative.medium.com/why-ai-hallucinates-and-how-mits-semantic-firewall-wants-to-fix-it-72e39d83ae25 | |||
| 17:38 | Qwen3-Next: Towards Ultimate Training and Inference Efficiency https://qwen.ai/blog | |||
| 17:21 | On Tokenization — Learning the Complexities https://medium.com/@rajanbhateja6/on-tokenization-learning-the-complexities-4e3aa66ba40b | |||
| 17:16 | Bias in LLMs: How It Happens https://medium.com/genai-llms/bias-in-llms-how-it-happens-0c3ab76ccebd | |||
| 17:11 | On Word Embeddings & Vector Databases — Storing More than Just Words https://medium.com/@rajanbhateja6/on-word-embeddings-vector-databases-storing-more-than-just-words-cdcbd03cbf94 | |||
| 16:54 | How to turn Claude Code into a domain specific coding agent https://blog.langchain.com/how-to-turn-claude-code-into-a-domain-specific-coding-agent/ | |||
| 16:45 | Zonos-Hebrew: Fine-Tuning Zonos on SASPEECH with a Phonikud Phoneme Pipeline https://medium.com/@maxme006/zonos-hebrew-fine-tuning-zonos-on-saspeech-with-a-phonikud-phoneme-pipeline-397e6d5717c8 | |||
| 16:30 | MCP — The Missing Elixir for LLMs https://medium.com/@yaswanthmitta/mcp-the-missing-elixir-for-llms-17a6726b75eb | |||
| 16:26 | The Three Core Skills Every AI Engineer Actually Needs in 2025 https://ai.plainenglish.io/the-three-core-skills-every-ai-engineer-actually-needs-in-2025-ab9acff651e3 | |||
| 16:26 | The Hidden Truth Behind AI’s Inconsistency: Thinking Machines Reveals the Root Cause and… https://medium.com/aimonks/the-hidden-truth-behind-ais-inconsistency-thinking-machines-reveals-the-root-cause-and-cbaf3ba39802 | |||
| 15:56 | How to Write Prompts: 7 Steps to Unlock AI’s Full Potential in 2025 https://medium.com/@RendonMx/how-to-write-prompts-7-steps-to-unlock-ais-full-potential-in-2025-7bdf7f41984e | |||
| 15:34 | Süni İntellekt, Maşın Öyrənməsi, Dərin Öyrənmə və Generativ Süni İntellektə Baxış https://medium.com/@aiselmammedova/s%C3%BCni-i%CC%87ntellekt-ma%C5%9F%C4%B1n-%C3%B6yr%C9%99nm%C9%99si-d%C9%99rin-%C3%B6yr%C9%99nm%C9%99-v%C9%99-generativ-s%C3%BCni-i%CC%87ntellekt%C9%99-bax%C4%B1%C5%9F-35258c5597b8 | |||
| 15:10 | Paragen Technical Delivery Roadmap for Q3–Q4 2025 https://medium.com/@Parallelai_blog/paragen-technical-delivery-roadmap-for-q3-q4-2025-e1374a3bf939 | |||
| 15:06 | Show HN: Asxiv.org – Ask ArXiv papers questions through chat https://asxiv.org/ | |||
| 15:05 | When ‘Environment’ Becomes ‘Evaluation’: The Semantic Inflation of AI Terminology https://ai-engineering-trend.medium.com/when-environment-becomes-evaluation-the-semantic-inflation-of-ai-terminology-bd646915d1a3 | |||
| 15:05 | NotebookLM Updates FAQ and Timeline Features, But User Experience Still Needs Improvement https://ai-engineering-trend.medium.com/notebooklm-updates-faq-and-timeline-features-but-user-experience-still-needs-improvement-543d283b8083 | |||
| 15:01 | LAI #92: AI Hype vs. Reality, Deepfake Detection, and Copilot+ PCs https://pub.towardsai.net/lai-92-ai-hype-vs-reality-deepfake-detection-and-copilot-pcs-8e01402c802c | |||
| 15:01 | LLMs: Should You Prompt, RAG, or Fine-Tune? https://medium.com/@bhargavi_guddati/llms-should-you-prompt-rag-or-fine-tune-9387ecb183d4 | |||
| 14:56 | Crafting Multi-Agent RAG Systems with DSPy and GEPA Optimization https://medium.com/@tam.tamanna18/crafting-multi-agent-rag-systems-with-dspy-and-gepa-optimization-363e74e54bea | |||
| 14:46 | How Enterprises Can Audit Their AI Visibility https://medium.com/@tim_62250/how-enterprises-can-audit-their-ai-visibility-fef43ab36716 | |||
| 14:42 | Network and Storage Benchmarks for LLM Training on the Cloud https://maknee.github.io/blog/2025/Network-And-Storage-Training-Skypilot/ | |||
| 14:13 | “Persistence ≈ Creation”: Why Cooperative Intelligence Can Spread by Natural Law https://medium.com/@omanyuk/persistence-creation-why-cooperative-intelligence-can-spread-by-natural-law-a143988ec942 | |||
| 14:06 | The AI Banana That’s Eating Photoshop’s Lunch https://medium.com/write-a-catalyst/the-ai-banana-thats-eating-photoshop-s-lunch-11698b843082 | |||
| 13:56 | <The Misfit at Tech’s Cool Kids Table: Why Artists Are Indispensable in the AI Revolution> https://medium.com/@fernandofula.art/the-misfit-at-techs-cool-kids-table-why-artists-are-indispensable-in-the-ai-revolution-c0aec4ff3224 | |||
| 13:34 | AI Mode: how it works and what it means for Ukrainian SEO https://medium.com/@hostpro.ua/ai-mode-how-it-works-and-what-it-means-for-ukrainian-seo-d76c5e22f1a6 | |||
| 12:52 | LLM’s Simplified — Language Modelling and Decoding https://sampathkumaran.medium.com/llms-simplified-language-modelling-and-decoding-2402ae5eb85c | |||
| 12:52 | From LLMs(Large Language Models) to LCMs( Large Concept Models) https://www.towardsdeeplearning.com/from-llms-large-language-models-to-lcms-large-concept-models-39c42b964348 | |||
| 12:44 | How GPUs Revolutionize Vector Search: CUDA, cuVS, and Faiss in Action https://medium.com/mlworks/how-gpus-revolutionize-vector-search-cuda-cuvs-and-faiss-in-action-ac2f5dc6c410 | |||
| 12:43 | Small LLMs: When to Prefer 1–8B Models, LoRA/QLoRA, and Low-VRAM Finetuning Recipes https://medium.com/@hritikrai55/small-llms-when-to-prefer-1-8b-models-lora-qlora-and-low-vram-finetuning-recipes-333fd2df8a62 | |||
| 12:37 | Why RAG is Like a Triple Espresso Shot☕ for Your AI: The Caffeine Boost Your Chatbot Didn’t Know… https://medium.com/@krishnajamora4007/why-rag-is-like-a-triple-espresso-shot-for-your-ai-the-caffeine-boost-your-chatbot-didnt-know-96ac08feb0cd | |||
| 12:31 | A quick take on K8s 1.34 GA DRA: 7 questions you probably have https://blog.devops.dev/a-quick-take-on-k8s-1-34-ga-dra-7-questions-you-probably-have-e981966f06c7 | |||
| 12:31 | The Free AI Tool They Don’t Want You to Know About: All LLMs at One Place https://lifeindraft.medium.com/the-free-ai-tool-they-dont-want-you-to-know-about-all-llms-at-one-place-6f5e754079dc | |||
| 12:14 | A deeper look into using MCP in the enterprise https://medium.com/dsaid-govtech/a-deeper-look-into-using-mcp-in-the-enterprise-d0200915550b | |||
| 12:10 | Supercharge Your Sentence Embeddings: A Tale of Two Loss Functions https://medium.com/@cd_24/supercharge-your-sentence-embeddings-a-tale-of-two-loss-functions-f325f88aab6a | |||
| 12:08 | Prompt Engineering: O Guia Definitivo para Dominar a Comunicação com IA https://medium.com/@mathcoimbr4/prompt-engineering-o-guia-definitivo-para-dominar-a-comunica%C3%A7%C3%A3o-com-ia-750110c09f1e | |||
| 12:05 | When Words Learn to See https://ai.gopubby.com/when-words-learn-to-see-940b1baac63e | |||
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124