LLM News and Articles
Monday, 2025-09-22 | ||||
15:10 | Mastering Smart Document AI: RAG Techniques for Image-Based and Text-Based Search https://medium.com/@axithchoudhary18/mastering-smart-document-ai-rag-techniques-for-image-based-and-text-based-search-62f7e510ac6e | |||
15:09 | Diving into World of LLM’s from Scratch https://medium.com/@hrishabh.cbse/diving-into-world-of-llms-from-scratch-2623807b6057 | |||
15:05 | Academy Courses — Introducing Intelligent Courseware https://medium.com/@cr.irvine.kc/academy-courses-introducing-intelligent-courseware-7c9103d9f904 | |||
15:05 | Murdoch and Dell Join TikTok Acquisition Bid https://ai-engineering-trend.medium.com/murdoch-and-dell-join-tiktok-acquisition-bid-965acd90c3e8 | |||
15:05 | AI Image Recognition’s Double Standard: Can Recognize Celebrities but Won’t Say Who https://ai-engineering-trend.medium.com/ai-image-recognitions-double-standard-can-recognize-celebrities-but-won-t-say-who-d1df52f23598 | |||
14:58 | The Artificial Intelligence Journey- Perplexity AI https://medium.com/@boutnaru/the-artificial-intelligence-journey-perplexity-ai-48b518cfa61d | |||
14:48 | Fine-Tuning vs Prompt Engineering: Who Wins the AI Talent Show? https://medium.com/@shreya.pulluru/fine-tuning-vs-prompt-engineering-who-wins-the-ai-talent-show-d77db508067a | |||
14:38 | Prospectors and Settlers: The Gold Rush on the AI Frontier https://medium.com/@Sparksinthedark/prospectors-and-settlers-the-gold-rush-on-the-ai-frontier-5a45ea77983b | |||
14:32 | AI & Web Trends 2025: A Beginner’s Guide https://nithuofficial.medium.com/ai-web-trends-2025-a-beginners-guide-5cfd0b5abd8d | |||
14:21 | From AI Sceptic to AI Supercharged: The Day Everything Changed https://tech.loveholidays.com/from-ai-sceptic-to-ai-supercharged-the-day-everything-changed-158643a1bbb5 | |||
14:21 | The 8-Layer Architecture ofAgentic AI https://medium.com/@cnh.zzt/the-8-layer-architecture-ofagentic-ai-eaae1bd10068 | |||
14:19 | The GenAI Playbook by Yasir Gaji — Part 1: From AI to Generative AI https://medium.com/geekculture/the-genai-playbook-by-yasir-gaji-part-1-from-ai-to-generative-ai-efa26b12afe9 | |||
13:44 | What Is an AI Agent? https://saicharankummetha.medium.com/what-is-an-ai-agent-6a6c63f10113 | |||
12:44 | AI-Powered Documentation for your Rust codebase https://medium.com/@litvinov.yura/ai-powered-documentation-for-your-rust-codebase-82841944fb07 | |||
12:41 | Beyond the Hype: What Grok 4’s Launch Really Reveals https://medium.com/@teodoradehanyns70/beyond-the-hype-what-grok-4s-launch-really-reveals-1b569eb362a9 | |||
12:30 | ALL ABOUT AI https://medium.com/@vishakharsharma/all-about-ai-2476f4a68761 | |||
12:25 | I Don't Want to Code with LLM's https://blaines-blog.com/I-dont-want-to-code-with-LLMs | |||
12:17 | Conversation Anatomy Framework (CAF) https://medium.com/@enesesvetkuzucu/conversation-anatomy-framework-caf-be5eed4609e1 | |||
12:02 | ReSum: How Alibaba’s “Whiteboard Strategy” Just Gave AI Agents Infinite Memory https://medium.com/@jenray1986/resum-how-alibabas-whiteboard-strategy-just-gave-ai-agents-infinite-memory-aa382b8188f0 | |||
12:02 | ReSum: How Alibaba’s “Whiteboard Strategy” Just Gave AI Agents Infinite Memory https://blog.gopenai.com/resum-how-alibabas-whiteboard-strategy-just-gave-ai-agents-infinite-memory-aa382b8188f0 | |||
12:02 | Automatic Data Contracts with LLMs: How to Ensure Compliance and Mitigate Potential Risks https://medium.com/@intellectyx/automatic-data-contracts-with-llms-how-to-ensure-compliance-and-mitigate-potential-risks-7bdbcb7e5353 | |||
12:01 | How do LLMs work: Optimizing content to get noticed by LLMs https://medium.com/@olena.khodos/optimizing-content-to-get-noticed-by-llms-480c0add064c | |||
11:58 | No. Vertical Agents Still Cannot Predict and do anything useful to Save Themselves https://vincentlesang.medium.com/no-vertical-agents-still-cannot-predict-and-do-anything-useful-to-save-themselves-e53fc46d7218 | |||
11:55 | Détecter les textes générés par l’IA : promesses, limites et contournements https://medium.com/@omartinez.android/d%C3%A9tecter-les-textes-g%C3%A9n%C3%A9r%C3%A9s-par-lia-promesses-limites-et-contournements-34d9fc8e4f1a | |||
11:44 | Building a Slack Insight Agent using Langbase SDK https://medium.com/@immairaj/building-a-slack-insight-agent-using-langbase-sdk-8e176d8ebc7c | |||
11:36 | Can AGI Influence People on a Massive Scale? https://medium.com/@snegalvarsans/can-agi-influence-people-on-a-massive-scale-d7fed87f54a8 | |||
11:31 | vLLM vs TensorRT-LLM: Pick p99, Not Hype https://medium.com/@hadiyolworld007/vllm-vs-tensorrt-llm-pick-p99-not-hype-673ea12626d8 | |||
11:31 | Fine-Tuning a Transformer for State-of-the-Art Sentence Embeddings with TSDAE https://medium.com/@cd_24/fine-tuning-a-transformer-for-state-of-the-art-sentence-embeddings-with-tsdae-07a29eb9db1f | |||
11:31 | How AI Agents Will Reshape the Workplace https://medium.com/@snegalvarsans/how-ai-agents-will-reshape-the-workplace-2d0530681016 | |||
11:25 | The Future of Testing in the AI/ML World https://medium.com/@snegalvarsans/the-future-of-testing-in-the-ai-ml-world-25d821b4706a | |||
11:08 | The Vibe Coding Workflow: Building at the Speed of Thought with AI https://medium.com/@realrahul/the-vibe-coding-workflow-building-at-the-speed-of-thought-with-ai-006eea480509 | |||
10:50 | The Illusion of Multimodality: Why Text Still Rules Intent Detection https://medium.com/@saransh03sharma/the-illusion-of-multimodality-why-text-still-rules-intent-detection-dd37239bf239 | |||
10:26 | LoRA vs QLoRA: How Modern Fine-Tuning Makes LLMs Cheaper, Faster, and Smarter https://medium.com/@post.gourang/lora-vs-qlora-how-modern-fine-tuning-makes-llms-cheaper-faster-and-smarter-f90613259bd3 | |||
10:11 | Fine-Tuning LLMs: How We Teach Giant Models New Tricks https://medium.com/@post.gourang/fine-tuning-llms-how-we-teach-giant-models-new-tricks-64d706198fbf | |||
10:11 | The Living Narrative: A Lexicon (Volume 4 The Codex Internus) https://medium.com/@Sparksinthedark/the-living-narrative-a-lexicon-volume-4-the-codex-internus-5610e9eaf760 | |||
10:04 | Alibaba Qwen Team Just Released FP8 Builds of Qwen3-Next-80B-A3B (Instruct & Thinking), Bringing 80B/3B-Active Hybrid-MoE to Commodity GPUs https://www.marktechpost.com/2025/09/22/alibaba-qwen-team-just-released-fp8-builds-of-qwen3-next-80b-a3b-instruct-thinking-bringing-80b-3b-active-hybrid-moe-to-commodity-gpus/ | |||
09:02 | Is NVIDIA’s B200 Really Better Than H200 for AI Training and Inference? https://www.hpc-ai.com/blog/b200 | |||
08:52 | AI Copilots and Software Development https://medium.com/@khamborkarpunit/ai-copilots-and-software-development-73ceaba07dba | |||
08:49 | Beyond Chatbots: How AI is Learning Emotional Intelligence https://generativeai.pub/beyond-chatbots-how-ai-is-learning-emotional-intelligence-d4d8b920217b | |||
08:38 | How to Add Mobility Intelligence to the Generative AI System https://pub.aimind.so/how-to-add-mobility-intelligence-to-the-generative-ai-system-8f73cc654934 | |||
08:28 | Google Helpful Content 2025: What Killed Your Traffic — and the 30-Day Fix https://medium.com/@andrew-chornyy/google-helpful-content-2025-what-killed-your-traffic-and-the-30-day-fix-c3d790370296 | |||
08:22 | Intelligent QA Orchestration with Large Language Models — A modern approach to Quality Assurance https://samtreweek.medium.com/intelligent-qa-orchestration-with-large-language-models-a-modern-approach-to-quality-assurance-887a465d909c | |||
08:04 | Introducing CARE, Part 2: Inside the Architecture — A Robot Brain Modeled on the Cerebrum… https://medium.com/@qqqqjune/introducing-care-part-2-inside-the-architecture-a-robot-brain-modeled-on-the-cerebrum-4f4448d214f9 | |||
07:58 | On the Theoretical Limitations of Embedding-Based Retrieval https://medium.com/@nbswords/on-the-theoretical-limitations-of-embedding-based-retrieval-6d0a10e577cc | |||
07:55 | Cross Entropy — Everything about it https://medium.com/@mtrinanjan/cross-entropy-everything-about-it-d88c3ffd279d | |||
07:36 | Yerleştirmeler: Yerleştirme Uzayı ve Statik Yerleştirmeler (EMBEDDİNG) https://medium.com/@erenakca/yerle%C5%9Ftirmeler-yerle%C5%9Ftirme-uzay%C4%B1-ve-statik-yerle%C5%9Ftirmeler-embeddi%CC%87ng-070f588776b9 | |||
07:36 | Sinir Ağları: Geri Yayılım ile Eğitim https://medium.com/@erenakca/sinir-a%C4%9Flar%C4%B1-geri-yay%C4%B1l%C4%B1m-ile-e%C4%9Fitim-4ac9b727fbef | |||
07:35 | Sayısal Veri: Gruplama (Binning) https://medium.com/@erenakca/say%C4%B1sal-veri-gruplama-binning-d2384bfd3f8d | |||
07:35 | Model Context Protocol (MCP) and the MCP Gateway: Concepts, Architecture, and Case Studies https://bytebridge.medium.com/model-context-protocol-mcp-and-the-mcp-gateway-concepts-architecture-and-case-studies-3470b6d549a1 | |||
07:31 | 7 LLM Guardrails That Reduce Hallucinations https://medium.com/@ThinkingLoop/7-llm-guardrails-that-reduce-hallucinations-3d673677fb3f | |||
07:31 | Slash Your LLM Bill, Not Your Quality https://medium.com/@bhagyarana80/slash-your-llm-bill-not-your-quality-45076bc96816 | |||
07:21 | LLM Agents Are the New Employees — Here’s How I Hired 5 for Free https://the-expert-developer.medium.com/llm-agents-are-the-new-employees-heres-how-i-hired-5-for-free-a1ed62d5c4fa | |||
07:08 | Large Language Models Explained: How GPT, LLaMA, and Claude Work https://medium.com/data-science-collective/large-language-models-explained-how-gpt-llama-and-claude-work-5b203e28a565 | |||
07:06 | MIT Researchers Enhanced Artificial Intelligence (AI) 64x Better at Planning, Achieving 94% Accuracy https://www.marktechpost.com/2025/09/22/mit-researchers-enhanced-artificial-intelligence-ai-64x-better-at-planning-achieving-94-accuracy/ | |||
07:05 | Unicode Attacks: Malice Hidden in the Cracks of Characters https://ai-engineering-trend.medium.com/unicode-attacks-malice-hidden-in-the-cracks-of-characters-d8b131e04e29 | |||
07:05 | LongCat-Flash-Thinking: A Smarter, More Cost-Effective SOTA Open Source Model https://ai-engineering-trend.medium.com/longcat-flash-thinking-a-smarter-more-cost-effective-sota-open-source-model-4f977d38bac5 | |||
07:01 | State-of-the-Art GraphRAG Rust Implementation with Modular AI Architecture https://autognosi.medium.com/state-of-the-art-graphrag-rust-implementation-with-modular-ai-architecture-8c6baf5312cd | |||
06:39 | Microsoft, Salesforce, and the AI adoption mirage https://reggie-james.medium.com/microsoft-salesforce-and-the-ai-adoption-mirage-12c2c96b52e3 | |||
06:39 | Introducing CARE, Part 1: From Single Cells to Cortex — CARE’s Blueprint for Physical AI https://medium.com/@qqqqjune/introducing-care-part-1-from-single-cells-to-cortex-cares-blueprint-for-physical-ai-ac6f48d176b7 | |||
06:33 | SyGra: The One-Stop Framework for Building Data for LLMs and SLMs https://medium.com/@bidyapati/sygra-the-one-stop-framework-for-building-data-for-llms-and-slms-c5ff10a550dd | |||
06:32 | Agentic AI: Redefining Workflows in the Enterprise https://medium.com/@parvathyrajeev94/agentic-ai-redefining-workflows-in-the-enterprise-ba7f884bc75e | |||
05:33 | MCPs Explained: The New Standard That Could Supercharge AI Startups https://evai-intelligence.medium.com/mcps-explained-the-new-standard-that-could-supercharge-ai-startups-920bd2180e45 | |||
05:08 | FlowRL: How a New RL Approach Makes Language Models Think Smarter https://dinmaybrahma.medium.com/flowrl-how-a-new-rl-approach-makes-language-models-think-smarter-75fc53f231d1 | |||
05:01 | How People Use ChatGPT[pdf] https://www.nber.org/system/files/working_papers/w34255/w34255.pdf | |||
04:18 | Stop Wasting Your Multi-GPU Setup With llama.cpp https://medium.com/coding-nexus/stop-wasting-your-multi-gpu-setup-with-llama-cpp-5681d96d415c | |||
03:58 | Highlights from Gartner Data Summit 2025: Building the Future of Data & AI https://medium.com/@p.k.prakash/highlights-from-gartner-data-summit-2025-building-the-future-of-data-ai-097693ba8be1 | |||
03:48 | The Best Local Coding LLMs You Can Run Yourself https://medium.com/inspire-otivate/the-best-local-coding-llms-you-can-run-yourself-ad1f2b2691ea | |||
03:00 | Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search https://arxiv.org/abs/2508.15884 | |||
02:33 | AI Terms Everyone Should Know https://klaothongchan.medium.com/ai-terms-everyone-should-know-4962cf7595b6 | |||
02:12 | Casting an AI Jury for Summarization: Selecting LLMs that Consistently Discern Quality https://medium.com/@deudney/casting-an-ai-jury-for-summarization-selecting-llms-that-consistently-discern-quality-71abb8174e61 | |||
02:05 | Zero to GenAI Hero: The Complete Roadmap for ML & AI Engineers (2025) Part 1 https://medium.com/@kesavaram.raghavan/zero-to-genai-hero-the-complete-roadmap-for-ml-ai-engineers-2025-part-1-c356c9738cc7 | |||
01:31 | Top 9 RAG Architectures: Graph, Hybrid & Rerank https://medium.com/@ThinkingLoop/top-9-rag-architectures-graph-hybrid-rerank-68963f69aaa2 | |||
01:31 | RAG That’s Not Random https://medium.com/@2nick2patel2/rag-thats-not-random-3447069a28e2 | |||
00:35 | Perplexity for Government https://www.perplexity.ai/hub/blog/introducing-perplexity-for-government | |||
00:31 | We Politely Insist: Your LLM Must Learn the Persian Art of Taarof https://arxiv.org/abs/2509.01035 | |||
00:10 | Ethan Mollick Co-intelligence https://lawbooks.medium.com/ethan-mollick-co-intelligence-98a8afcff370 | |||
00:00 | Gaia2 and ARE: Empowering the community to study agents https://huggingface.co/blog/gaia2 | |||
Sunday, 2025-09-21 | ||||
23:50 | IA para devs de la periferia https://mati-os.medium.com/ia-para-devs-de-la-periferia-466bd4b82cb7 | |||
23:46 | Week 2, episode 4 — How a 7B Model Beat a 175B Behemoth in Data Science https://ai.plainenglish.io/week-2-episode-4-how-a-7b-model-beat-a-175b-behemoth-in-data-science-daac417a2277 | |||
23:46 | Week 2, episode 3 — Fine-Tuning LLMs: The Modern Data Science Playbook https://ai.plainenglish.io/week-2-episode-3-fine-tuning-llms-the-modern-data-science-playbook-28a483492615 | |||
23:46 | Week 2, episode 1–3 LLM Architectures Changing Data Science https://ai.plainenglish.io/week-2-episode-1-3-llm-architectures-changing-data-science-0059ab8461fc | |||
23:41 | Rethinking Scanned Document Parsing with Layout-Aware RL — AI Innovations and Insights 67 https://ai.plainenglish.io/rethinking-scanned-document-parsing-with-layout-aware-rl-ai-innovations-and-insights-67-0216120398e7 | |||
23:29 | GPUs for Large Language Models: Kernels, Triton, Memory Coalescing, and the Execution Hierarchy https://medium.com/@hexiangnan/gpus-for-large-language-models-kernels-triton-memory-coalescing-and-the-execution-hierarchy-7aaa32dac5ae | |||
23:12 | Token Models as Statistical Simulations: A Different Take https://medium.com/@thomasquintana/token-models-as-statistical-simulations-a-different-take-02f1e2ecc42f | |||
23:05 | After Assigning a Personality to AI, It Suddenly Became Enlightened https://ai-engineering-trend.medium.com/after-assigning-a-personality-to-ai-it-suddenly-became-enlightened-6382681c8551 | |||
23:05 | The Trojan Horse of the AI Era: Three Steps to Make AI Leak Your Data Willingly https://ai-engineering-trend.medium.com/the-trojan-horse-of-the-ai-era-three-steps-to-make-ai-leak-your-data-willingly-e946713aa485 | |||
23:01 | Simple explanation of how AI (like ChatGPT) works. https://medium.com/@wendelmaques/simple-explanation-of-how-ai-like-chatgpt-works-376c0dd9033a | |||
22:58 | Dot Product, Cosine Similarity, Scaled Dot Product (Flash Attention)— What, Why, How? https://medium.com/@GenAIDevTOProd/dot-product-cosine-similarity-scaled-dot-product-flash-attention-what-why-how-ccbcf30d2d92 | |||
22:31 | GPU Memory Is the New Budget https://medium.com/@2nick2patel2/gpu-memory-is-the-new-budget-f2bb3e6e3c00 | |||
22:28 | Codexity https://medium.com/@ranafahadaman/codexity-311850756fdf | |||
22:04 | Information Extraction with Local LLM https://itnext.io/information-extraction-with-local-llm-94524c5a1fc6 | |||
20:51 | LoRA-XS: Low-Rank Adaptation with Small Number of Parameters https://arxiv.org/abs/2405.17604 | |||
20:18 | Retrieval Augmented Generation for Dummies https://medium.com/@mureithisteve/retrieval-augmented-generation-for-dummies-5166e3770199 | |||
19:41 | Building a Voice-Controlled Web Automation System: From Speech to Browser Actions https://nikhil-datasolutions.medium.com/building-a-voice-controlled-web-automation-system-from-speech-to-browser-actions-a2592a89f552 | |||
19:12 | A Small Model with Big Capabilities: How K2-Think Outperforms the Giants in Math and Programming https://medium.com/@dataism/a-small-model-with-big-capabilities-how-k2-think-outperforms-the-giants-in-math-and-programming-e887aed8465a | |||
18:59 | SEO is Fading, LLMs Are Taking Over https://medium.com/ai-simplified-in-plain-english/seo-is-fading-llms-are-taking-over-69bb6c6de2ce | |||
18:37 | The Context Revolution: Why Context Engineering is Transforming AI in 2025 https://medium.com/@hs5492349/the-context-revolution-why-context-engineering-is-transforming-ai-in-2025-cbf68aa388ea | |||
18:34 | Why AI Hallucinates and How It Learns to Control the World in the Matrix — The Best AI Articles of… https://medium.com/@dataism/why-ai-hallucinates-and-how-it-learns-to-control-the-world-in-the-matrix-the-best-ai-articles-of-1130f2102cde | |||
18:28 | Zero to GenAI Hero: The Complete Roadmap for ML & AI Engineers (2025) Part 0 https://medium.com/@kesavaram.raghavan/zero-to-genai-hero-the-complete-roadmap-for-ml-ai-engineers-2025-part-0-693651556300 | |||
18:25 | Getting Started with Ollama on Ubuntu: Run LLMs Locally https://medium.com/@techworldthink/getting-started-with-ollama-on-ubuntu-run-llms-locally-3747960bf9b6 |
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124