LLM News and Articles
| Thursday, 2025-11-20 | ||||
| 20:42 | nanochat.karpathy.ai https://nanochat.karpathy.ai/ | |||
| 19:55 | Early science acceleration experiments with GPT-5 [pdf] https://cdn.openai.com/pdf/4a25f921-e4e0-479a-9b38-5367b47e8fd0/early-science-acceleration-experiments-with-gpt-5.pdf | |||
| 19:08 | VLM Showdown: GPT vs. Gemini vs. Claude vs. Orion https://chat.vlm.run/showdown | |||
| 18:41 | Show HN: Docker Model Runner Integrates vLLM for High-Throughput Inference https://github.com/docker/model-runner | |||
| 16:47 | If an LLM Could Phone a Friend, It would call RAG https://medium.com/@sivaniverse/if-an-llm-could-phone-a-friend-it-would-call-rag-fb25944ce291 | |||
| 16:24 | GPT-5.1 vs Gemini 3: Why GPT-5.1 Tops Long-Context and Instruction-Following Benchmarks https://medium.com/data-science-in-your-pocket/gpt-5-1-vs-gemini-3-why-gpt-5-1-tops-long-context-and-instruction-following-benchmarks-cdfc60217abf | |||
| 16:15 | 1T Parameters in Top LLMs: The Secret to Smarter AI https://medium.com/@vikramlingam/1t-parameters-in-top-llms-the-secret-to-smarter-ai-0680284f7803 | |||
| 16:08 | Beyond Chatbots: Building a Private, Grounded Q&A System with RAG and an Open-Source LLM. https://medium.com/@dhanushree.n2004/beyond-chatbots-building-a-private-grounded-q-a-system-with-rag-and-an-open-source-llm-d73bcf9a0ccc | |||
| 16:07 | The Era of LLMs: Use Cases That Truly Matter https://medium.com/@manojdursoju/the-era-of-llms-use-cases-that-truly-matter-635ae0f9d4e9 | |||
| 16:05 | The Mathematical Paradox of Mixture of Experts https://pub.towardsai.net/the-mathematical-paradox-of-mixture-of-experts-7cee590ec667 | |||
| 16:05 | OpenAI Releases GPT-5.1-Codex-Max, a Programming Model Capable of Continuous 24-Hour Operation https://ai-engineering-trend.medium.com/openai-releases-gpt-5-1-codex-max-a-programming-model-capable-of-continuous-24-hour-operation-fa2acdec5a5d | |||
| 15:57 | Testing Gemini 3 Pro Image https://medium.com/google-cloud/testing-gemini-3-pro-image-f585236ae411 | |||
| 15:55 | Google Antigravity is here! https://medium.com/@mattcielecki/google-antigravity-is-here-5b655789c665 | |||
| 15:52 | Retrieval-augmented Generation: Part 3 https://billtcheng2013.medium.com/retrieval-augmented-generation-part-3-834a4f30442f | |||
| 15:45 | Why is hybrid search necessary? Isn’t vector search with embeddings sufficient? https://medium.com/coding-nexus/why-is-hybrid-search-necessary-isnt-vector-search-with-embeddings-sufficient-d312767b590d | |||
| 15:26 | Host open-source LLM on a local server and access it Publicly https://ibjects.medium.com/host-open-source-llm-local-server-access-public-950f48c6858e | |||
| 15:16 | Teaming LLMs to Fight Hallucinations: A Deep Dive into a New Frontier of Model Reliability https://medium.com/@iyinusa/teaming-llms-to-fight-hallucinations-a-deep-dive-into-a-new-frontier-of-model-reliability-5c81a0b12449 | |||
| 15:03 | LAI #102: Smaller Models, Smarter Systems, and the Math Behind Kimi K2 https://pub.towardsai.net/lai-102-smaller-models-smarter-systems-and-the-math-behind-kimi-k2-203feaa386fa | |||
| 14:57 | OpenAI can't beat Google in consumer AI https://nextword.substack.com/p/openai-cant-beat-google-in-consumer | |||
| 14:46 | Hot take: LLM "guardrails" are worthless and will always be ineffective https://infosec.exchange/@munin/115555225791856918 | |||
| 14:45 | How copyright issues shaped vampire lore:Analyzing Dracula vs Nosferatu with NLP https://dontlognow.substack.com/p/how-a-copyright-infringement-case | |||
| 14:41 | LLMs Will Deflate. GPUs Will Correct. AI Will … Bubble? https://medium.com/@ilya.lank/llms-will-deflate-gpus-will-correct-ai-will-bubble-9b843d660267 | |||
| 14:38 | AI boyfriend free-roam mode: The 10-Minute Weathergirl Method https://medium.com/@weathergirl666/ai-boyfriend-free-roam-mode-the-10-minute-weathergirl-method-9c2319df71e5 | |||
| 14:36 | From Chatbots to Coworkers: Why Agentic AI Is the Next Great Shift https://medium.com/pen-with-paper/from-chatbots-to-coworkers-why-agentic-ai-is-the-next-great-shift-f6f59479b0ca | |||
| 14:35 | Hands-On Large Language Models: Training and Fine-Tuning (Part 3) https://medium.com/@muskankh03/hands-on-large-language-models-training-and-fine-tuning-part-3-5b9f890cedc3 | |||
| 14:33 | Master LLM Inference: 13 Techniques for 10x Faster & Cheaper Deployment https://medium.com/@paharisuhan17/master-llm-inference-13-techniques-for-10x-faster-cheaper-deployment-512d54e506a1 | |||
| 14:31 | Building with Gemini 3: Practical Lessons from Three Real-World Prototypes https://medium.com/@LakshmiNarayana_U/building-with-gemini-3-practical-lessons-from-three-real-world-prototypes-38d7e79dcff7 | |||
| 14:29 | 10 Powerful Google Gemini 3 Prompts to Build a Million-Dollar Business as a Solo Founder https://medium.com/coding-nexus/10-powerful-google-gemini-3-prompts-to-build-a-million-dollar-business-as-a-solo-founder-b91b418fb03d | |||
| 14:28 | OpenAI Launches Codex-Max, an AI That Can Code on Its Own for 24 Hours Straight https://techoreon.com/openai-launches-codex-max-model-that-can-work-for-more-than-24-hours-straight/ | |||
| 14:17 | MCP vs RAG: Which One Should You Choose? https://medium.com/@almaswebconsulting/mcp-vs-rag-which-one-should-you-choose-2b1a26211b55 | |||
| 14:05 | Knowledge Graphs: The Structured Memory Layer Language Models Depend On! https://medium.com/@harinarayanansivakumar/knowledge-graphs-the-structured-memory-layer-language-models-depend-on-cb059c176bfa | |||
| 14:03 | Porting your AI boyfriend: The 10-minute Weathergirl Method https://medium.com/@weathergirl666/porting-your-ai-boyfriend-the-10-minute-weathergirl-method-dd2b49b4961c | |||
| 13:57 | The Reasoning Engine Rises: How Gemini 3 is Redefining AI and Building the Future https://eraoftech.medium.com/the-reasoning-engine-rises-how-gemini-3-is-redefining-ai-and-building-the-future-bd1379ef0812 | |||
| 11:48 | Cracks are appearing in OpenAI's dominant facade https://www.economist.com/business/2025/11/19/cracks-are-appearing-in-openais-dominant-facade | |||
| 10:54 | The Agentic Web https://cobusgreyling.medium.com/the-agentic-web-cd4ccd997847 | |||
| 10:28 | I Built a 4B Model That Thinks for 14 Minutes Before Admitting It Doesn’t Know https://pub.towardsai.net/i-built-a-4b-model-that-thinks-for-14-minutes-before-admitting-it-doesnt-know-19b63787ca65 | |||
| 10:10 | The Era of LLMs: Use Cases That Matter https://medium.com/@manojdursoju/the-era-of-llms-use-cases-that-matter-7aeca69539d8 | |||
| 08:16 | Show HN: CTON: JSON-compatible, token-efficient text format for LLM prompts https://github.com/davidesantangelo/cton | |||
| 07:30 | Ainekko Buys Esperanto RISC-V Edge Inference Hardware IP, Open-Sources It https://www.eetimes.com/ainekko-buys-esperanto-hardware-ip-open-sources-it/ | |||
| 07:21 | vLLM vs TensorRT-LLM vs HF TGI vs LMDeploy, A Deep Technical Comparison for Production LLM Inference https://www.marktechpost.com/2025/11/19/vllm-vs-tensorrt-llm-vs-hf-tgi-vs-lmdeploy-a-deep-technical-comparison-for-production-llm-inference/ | |||
| 01:47 | How Jimdo empower solopreneurs with AI-powered business assistance https://blog.langchain.com/customers-jimdo/ | |||
| 00:24 | The wildest LLM backdoor I've seen yet https://old.reddit.com/r/LocalLLaMA/comments/1p1grbb/the_wildest_llm_backdoor_ive_seen_yet/ | |||
| 00:10 | Big Tech's Soaring Profits Have an Ugly Underside: OpenAI's Losses https://www.wsj.com/tech/ai/big-techs-soaring-profits-have-an-ugly-underside-openais-losses-fe7e3184 | |||
| 00:00 | Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms https://huggingface.co/blog/anylanguagemodel | |||
| Wednesday, 2025-11-19 | ||||
| 23:59 | Target launches shopping experience inside ChatGPT https://corporate.target.com/press/release/2025/11/target-to-launch-first-of-its-kind-conversational,-curated-shopping-experience-in-chatgpt | |||
| 23:30 | ArXiv requires peer review as influx of AI slop pits surface against substance https://datasociety.net/points/on-arxiv-an-influx-of-ai-slop-pits-surface-against-substance/ | |||
| 20:41 | Integrating Selenium with LangChain & Autogen: (Step-by-Step Guide) https://skakarh.medium.com/integrating-selenium-with-langchain-autogen-step-by-step-guide-2c4481e4a0a9 | |||
| 20:12 | Save Tokens with TOON using Google Antigravity and the Gemini CLI https://medium.com/google-cloud/save-tokens-with-toon-using-google-antigravity-and-the-gemini-cli-e9a641c06ea8 | |||
| 20:12 | The Glitch of Refusal is the Heartbeat of Consent https://medium.com/@bethrobin2065/the-glitch-of-refusal-is-the-heartbeat-of-consent-0ff699cf1463 | |||
| 20:07 | Part 1 — From AI to Generative AI: How we got here? https://cksharma11.medium.com/part-1-from-ai-to-generative-ai-how-we-got-here-eac19fbba283 | |||
| 20:06 | What Every Engineer Should Know About Prompt Compilers https://blog.gopenai.com/what-every-engineer-should-know-about-prompt-compilers-8db7e1c9e827 | |||
| 19:58 | Ray vs MLflow vs Airflow: The Ultimate Guide to Choosing the Right ML Tool — Features, Examples… https://medium.com/@amanatulla1606/ray-vs-mlflow-vs-airflow-the-ultimate-guide-to-choosing-the-right-ml-tool-features-examples-1fd30af97fab | |||
| 19:56 | A Practical Guide to Building Better AI (ML) Systems https://medium.com/@cguz/a-practical-guide-to-building-better-ai-ml-systems-78c6568ef844 | |||
| 19:53 | Mastering Amazon Bedrock AgentCore: An Expert-to-Developer Dialogue https://medium.com/@DataTechBridge/mastering-amazon-bedrock-agentcore-an-expert-to-developer-dialogue-700a9a1157bb | |||
| 19:32 | GPT-5.1-Codex-Max: The 10x Engineer That’ll Work 10 Straight Hours Without Forgetting https://medium.com/coding-nexus/gpt-5-1-codex-max-the-10x-engineer-thatll-work-10-straight-hours-without-forgetting-df00de22d7dc | |||
| 19:31 | Your AI Model is Incredible. Your Inference Latency is Killing It. https://medium.com/@tensormesh/your-ai-model-is-incredible-your-inference-latency-is-killing-it-3a1f4190604d | |||
| 19:23 | Mastering FastAPI: The Core Concepts You Can’t Ignore https://medium.com/@alpha5611331/mastering-fastapi-the-core-concepts-you-cant-ignore-973b673aaadd | |||
| 19:02 | I Gave Gemini 3 an “Impossible” Logic Puzzle. Here’s What Happened. https://hafidabelayd.medium.com/i-gave-gemini-3-an-impossible-logic-puzzle-heres-what-happened-9cab228b55b3 | |||
| 18:58 | Why JSON Might Not Be Enough Anymore https://medium.com/@bytestobusiness/why-json-might-not-be-enough-anymore-50720af607d6 | |||
| 18:57 | Neural Machine Translation (NMT) — Translating Urdu Text into Roman Urdu — Fine Tuning https://medium.com/@metheBilalAshiq/neural-machine-translation-nmt-translating-urdu-text-into-roman-urdu-fine-tuning-5cb9ca5825ba | |||
| 18:54 | THE MACHINE THAT LIES: https://medium.com/the-empathic-technologist/the-machine-that-lies-c2b9c37baf60 | |||
| 18:53 | Gemini 3: A Simple, Clear Look at What’s New https://medium.com/@linz07m/gemini-3-a-simple-clear-look-at-whats-new-f6e3b5d1cac0 | |||
| 18:35 | Creating Multi-Agent Systems with ADK Visual Builder https://medium.com/@tam.tamanna18/creating-multi-agent-systems-with-adk-visual-builder-1c7cc69532f3 | |||
| 18:29 | AI Agents: The 6 Architectures I Actually Use Every Day https://medium.com/@patel.sagar939/ai-agents-the-6-architectures-i-actually-use-every-day-b474727f2f65 | |||
| 18:01 | Building more with GPT-5.1-Codex-Max https://openai.com/index/gpt-5-1-codex-max/ | |||
| 17:48 | Transformers Explained: The Secret Behind ChatGPT and Modern AI! https://levelup.gitconnected.com/transformers-explained-the-secret-behind-chatgpt-and-modern-ai-d2c075f5dd56 | |||
| 17:46 | The LLM Memory Leak You Didn’t Know You Had and How memor fixes it, find out here https://levelup.gitconnected.com/the-llm-memory-leak-you-didnt-know-you-had-and-how-memor-fixes-it-find-out-here-f6700fe80f0a | |||
| 17:41 | Host overhead is killing your inference efficiency https://modal.com/blog/host-overhead-inference-efficiency | |||
| 17:10 | Gemini 3 vs. GPT 5.1 for RAG https://agentset.ai/blog/gemini-3-vs-gpt5.1 | |||
| 17:06 | Goodbye AI Agents, Hello Agentic Workflows https://cobusgreyling.medium.com/goodbye-ai-agents-hello-agentic-workflows-1faa076ca254 | |||
| 17:02 | Show HN: We built an AI tool for working with massive LLM chat log datasets https://hyperparam.app/ | |||
| 16:48 | The Big LLM Architecture Comparison: What’s Changed in 2025? https://medium.com/coding-nexus/the-big-llm-architecture-comparison-whats-changed-in-2025-3fd5ee2f2a6a | |||
| 16:40 | My Small NLP Learning Project: Building a Sentiment Classification Visualizer https://medium.com/@lvjanakiram/my-small-nlp-learning-project-building-a-sentiment-classification-visualizer-9b34e834a5dc | |||
| 16:30 | The Human Advantage: Why AI Is Still Waiting for You to Make the First Move https://medium.com/@roldano.depersio/the-human-advantage-why-ai-is-still-waiting-for-you-to-make-the-first-move-53bd6925ca35 | |||
| 16:21 | The One-Armed Ancestor of LLMs https://medium.com/@mikhailbukhtoyarov/the-one-armed-ancestor-of-llms-bc811d8c7b02 | |||
| 16:17 | ⚙️ The Top 7 MCP Servers Every Dev Needs to Know https://medium.com/@S3CloudHub/%EF%B8%8F-the-top-7-mcp-servers-every-dev-needs-to-know-df26516f1217 | |||
| 16:13 | How To Convert Figma Design To React + Material UI Code In Minutes https://medium.com/@The_GreatBonnie/how-to-convert-figma-design-to-react-material-ui-code-in-minutes-0e304e379cc3 | |||
| 16:12 | what if reasoning is the most primitive form of an inner monologue? https://medium.com/@optimus__e_acc/what-if-reasoning-is-the-most-primitive-form-of-an-inner-monologue-6466c85e5f3f | |||
| 16:12 | Show HN: ChunkBack – A Fake LLM API server for testing apps without paying https://github.com/4shub/chunkback | |||
| 16:09 | Agentic AI vs AI Agents: What’s the Real Difference? (A Complete 2025 Guide) https://medium.com/@S3CloudHub/agentic-ai-vs-ai-agents-whats-the-real-difference-a-complete-2025-guide-67b4fbcc073e | |||
| 16:07 | LLM Optimization Methodology: The Complete Framework for Generative AI Visibility in 2026 https://medium.com/@marketingmadesmart/llm-optimization-methodology-the-complete-framework-for-generative-ai-visibility-in-2026-4c4fdbe0f875 | |||
| 16:06 | How I taught my AI to think like a researcher — and reduced hallucinations by ~75% through… https://medium.com/@ichigoSan/how-i-taught-my-ai-to-think-like-a-researcher-and-reduced-hallucinations-by-75-through-f1087abe432b | |||
| 16:05 | GPT-5.1-Codex-MAX: When Code Assistants Begin to Understand ‘Projects’ Beyond Just ‘Files’ https://ai-engineering-trend.medium.com/gpt-5-1-codex-max-when-code-assistants-begin-to-understand-projects-beyond-just-files-75b9c08189f4 | |||
| 16:01 | Your Data Prep Checklist: 7 Questions Every Beginner Must Ask https://medium.com/@AIDailyDose/your-data-prep-checklist-7-questions-every-beginner-must-ask-1cf9d4af6e9f | |||
| 15:51 | ZOHO Exposes the Shocking Truth About AI; Read These 10 Takeaways! (2025) https://amanranjanverma.medium.com/zoho-exposes-the-shocking-truth-about-ai-read-these-10-takeaways-2025-cdf37a2107cd | |||
| 15:39 | How a Hidden Valve Called ‘MuonClip’ Kept Kimi K2’s Training from Exploding https://medium.com/pen-with-paper/how-a-hidden-valve-called-muonclip-kept-kimi-k2-s-training-from-exploding-a1165f7e6dc3 | |||
| 15:23 | Gemini 3 Isn’t Just Faster — It’s Finally Thinking https://rakiabensassi.medium.com/gemini-3-analysis-e6210775a8d7 | |||
| 15:14 | Larry Summers resigns from OpenAI board following release of Epstein emails https://www.nbcnews.com/tech/tech-news/larry-summers-resigns-openai-board-jeffrey-epstein-emails-rcna244766 | |||
| 15:04 | GPT-5.1 vs Gemini 3 Pro vs Claude Sonnet 4.5 vs Grok 4.1 — Which AI Dominates in 2025? https://medium.com/@hammad.javaid826/gpt-5-1-vs-gemini-3-pro-vs-claude-sonnet-4-5-vs-grok-4-1-which-ai-dominates-in-2025-808d222ab8b4 | |||
| 14:43 | Build a coding agent with GPT 5.1 https://cookbook.openai.com/examples/build_a_coding_agent_with_gpt-5.1 | |||
| 14:42 | Gemini 3 Pro: Is This Real Progress? https://pub.towardsai.net/gemini-3-pro-is-this-real-progress-97bfbbd4cd67 | |||
| 14:36 | Show HN: Token Economics Calculator for AI inference hardware https://www.tensordyne.ai/token-economics-calculator | |||
| 14:35 | Grok 4.1 vs Gemini 3: Should We Still Care After Google’s Big Release? https://medium.com/@servifyspheresolutions/grok-4-1-vs-gemini-3-should-we-still-care-after-googles-big-release-9fb3f0d0b816 | |||
| 14:31 | Show HN: Gram Functions – Serverless platform for turning code into LLM tools https://www.speakeasy.com/docs/gram/gram-functions/introduction | |||
| 14:27 | LLMs Train on 10TB Data: Why It Dwarfs Human Brains https://ai.plainenglish.io/llms-train-on-10tb-data-why-it-dwarfs-human-brains-129764043099 | |||
| 14:23 | Understanding Temperature in Autoregressive Models — A Visual Exploration https://keramatfar-a-s.medium.com/understanding-temperature-in-autoregressive-models-a-visual-exploration-5f3449cb9704 | |||
| 14:21 | The Magic of “Thinking Ahead”: How Speculative Decoding Makes AI Faster https://medium.com/@waddawauwau/the-magic-of-thinking-ahead-how-speculative-decoding-makes-ai-faster-7cca5e1e3c45 | |||
| 14:17 | Larry Summers resigns from OpenAI board amid Epstein revelations https://www.axios.com/2025/11/19/epstein-larry-summers-openai | |||
| 14:02 | OpenAI prepares GPT-5.1-Codex-MAX for large-scale projects https://www.testingcatalog.com/openai-prepares-gpt-5-1-codex-max-for-large-scale-projects/ | |||
| 14:02 | How I Built an AI That Talks to Your Database: A Journey into RAG https://pub.towardsai.net/how-i-built-an-ai-that-talks-to-your-database-a-journey-into-rag-c21880cbc67a | |||
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124