<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Llm on Neat Guy Coding</title><link>https://neatguycoding.com/tags/llm/</link><description>Recent content in Llm on Neat Guy Coding</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>© 2026 NeatGuyCoding</copyright><lastBuildDate>Mon, 18 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://neatguycoding.com/tags/llm/index.xml" rel="self" type="application/rss+xml"/><item><title>Agentic Topic Modeling: Embedding Pipelines, LLMs, and Human-in-the-Loop Engineering Trade-offs</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-topic-modeling-with-maarten-grootendorst-weaviate-podcast-126/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-topic-modeling-with-maarten-grootendorst-weaviate-podcast-126/</guid><description>Agentic topic modeling: modular embedding pipelines, LLM-maintained topic tables, and human-in-the-loop granularity—engineering trade-offs between BERTopic, TopicGPT, and retrieval-scale deployment.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-topic-modeling-with-maarten-grootendorst-weaviate-podcast-126/cover.png"/></item><item><title>Compound AI: When a Single LLM Call Is Not Enough</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-compound-ai-systems-with-philip-kiely-weaviate-podcast-105/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-compound-ai-systems-with-philip-kiely-weaviate-podcast-105/</guid><description>Compound AI: When a single LLM call is not enough—multiple model calls, retrievers, tools, and business logic as a graph; structured output, specialist pipelines, inference stacks, and deployment granularity from a Weaviate podcast with Baseten&amp;rsquo;s Philip Kiely.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-compound-ai-systems-with-philip-kiely-weaviate-podcast-105/cover.png"/></item><item><title>Judge-Time Compute: When LLM Evaluation Moves from a Single Score to a Composable Pipeline</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-haize-labs-with-leonard-tang-weaviate-podcast-121/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-haize-labs-with-leonard-tang-weaviate-podcast-121/</guid><description>Judge-time compute: stacking structured, composable weak-model calls at evaluation time instead of assuming one expensive judge pass is enough—Verdict, agreement metrics, and production guardrails, with evidence boundaries called out.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-haize-labs-with-leonard-tang-weaviate-podcast-121/cover.png"/></item><item><title>Semantic Query Engines: When LLM Operators Enter the Query Optimizer</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-semantic-query-engines-with-matthew-russo-weaviate-podcast-131/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-semantic-query-engines-with-matthew-russo-weaviate-podcast-131/</guid><description>Semantic query engines treat foundation-model filter, join, classify, map, and rank as first-class operators—logical and physical plans, cost–quality tradeoffs, SemBench workloads, and how they differ from script-style RAG and vector search alone.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-semantic-query-engines-with-matthew-russo-weaviate-podcast-131/cover.png"/></item><item><title>When Format Constraints Hurt LLMs: A Split Between Agent Pipelines and Benchmark Evaluation</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-let-me-speak-freely-with-zhi-rui-tam-weaviate-podcast-108/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-let-me-speak-freely-with-zhi-rui-tam-weaviate-podcast-108/</guid><description>When format constraints hurt LLMs: the same structured-output techniques often lower scores on reasoning tasks and raise them on discrete classification—from agent pipelines to benchmark evaluation.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-let-me-speak-freely-with-zhi-rui-tam-weaviate-podcast-108/cover.png"/></item></channel></rss>