<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Weaviate-Podcast on Neat Guy Coding</title><link>https://neatguycoding.com/tags/weaviate-podcast/</link><description>Recent content in Weaviate-Podcast on Neat Guy Coding</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>© 2026 NeatGuyCoding</copyright><lastBuildDate>Mon, 18 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://neatguycoding.com/tags/weaviate-podcast/index.xml" rel="self" type="application/rss+xml"/><item><title>Agent Oversight Stack: From Static Evaluation to Trajectory-Level Observability</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-patronus-ai-with-anand-kannappan-weaviate-podcast-122/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-patronus-ai-with-anand-kannappan-weaviate-podcast-122/</guid><description>Agent oversight stack: from static evaluation to trajectory-level observability—evaluation, observability, and supervision for multi-agent systems, with Percival, Lynx, and Glider, and evidence boundaries called out.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-patronus-ai-with-anand-kannappan-weaviate-podcast-122/cover.png"/></item><item><title>Agentic RAG: When Retrieval Pipelines Grow a Planning-and-Tools Loop</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-rag-with-erika-cardenas-weaviate-podcast-109/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-rag-with-erika-cardenas-weaviate-podcast-109/</guid><description>Agentic RAG: When retrieval pipelines add LLM plan–act–observe loops, tool calling, and multi-step validation—separating verified docs from interview speculation for production teams.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-rag-with-erika-cardenas-weaviate-podcast-109/cover.png"/></item><item><title>Agentic Topic Modeling: Embedding Pipelines, LLMs, and Human-in-the-Loop Engineering Trade-offs</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-topic-modeling-with-maarten-grootendorst-weaviate-podcast-126/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-topic-modeling-with-maarten-grootendorst-weaviate-podcast-126/</guid><description>Agentic topic modeling: modular embedding pipelines, LLM-maintained topic tables, and human-in-the-loop granularity—engineering trade-offs between BERTopic, TopicGPT, and retrieval-scale deployment.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-agentic-topic-modeling-with-maarten-grootendorst-weaviate-podcast-126/cover.png"/></item><item><title>Agents on Semi-Structured Retrieval: STaRK Benchmark and AvaTaR Optimization</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-optimizing-retrieval-agents-with-shirley-wu-weaviate-podcast-115/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-optimizing-retrieval-agents-with-shirley-wu-weaviate-podcast-115/</guid><description>Stanford&amp;rsquo;s STaRK benchmark and AvaTaR contrastive optimization for retrieval agents on semi-structured knowledge bases—metrics, multi-vector limits, when agents lose to dense retrievers, and what to ship in production.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-optimizing-retrieval-agents-with-shirley-wu-weaviate-podcast-115/cover.png"/></item><item><title>AI-Powered Search: When RAG, Agents, and Classic IR Get Rewired</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-doug-turnbull-and-trey-grainger-on-ai-powered-search-weaviate-podcast-13/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-doug-turnbull-and-trey-grainger-on-ai-powered-search-weaviate-podcast-13/</guid><description>AI-Powered Search: When RAG, agents, and classic IR get rewired—retrieval quality vs. agent loops, long context vs. searchable history, leaderboard embeddings vs. domain corpora, with Doug Turnbull and Trey Grainger on what ships.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-doug-turnbull-and-trey-grainger-on-ai-powered-search-weaviate-podcast-13/cover.png"/></item><item><title>Architectural Tension in the Voice-Agent Era: SSMs, Low-Latency TTS, and Whether End-to-End Eats the Orchestration Stack</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-cartesia-ai-with-karan-goel-weaviate-podcast-113/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-cartesia-ai-with-karan-goel-weaviate-podcast-113/</guid><description>Architectural tension in the voice-agent era: SSMs, low-latency TTS, and whether end-to-end models will displace compound orchestration chains.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-cartesia-ai-with-karan-goel-weaviate-podcast-113/cover.png"/></item><item><title>Compound AI: When a Single LLM Call Is Not Enough</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-compound-ai-systems-with-philip-kiely-weaviate-podcast-105/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-compound-ai-systems-with-philip-kiely-weaviate-podcast-105/</guid><description>Compound AI: When a single LLM call is not enough—multiple model calls, retrievers, tools, and business logic as a graph; structured output, specialist pipelines, inference stacks, and deployment granularity from a Weaviate podcast with Baseten&amp;rsquo;s Philip Kiely.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-compound-ai-systems-with-philip-kiely-weaviate-podcast-105/cover.png"/></item><item><title>Data Agents: When Code-Writing Models Meet the Real Data Stack</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-data-agents-with-shreya-shankar-weaviate-podcast-135/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-data-agents-with-shreya-shankar-weaviate-podcast-135/</guid><description>Data agents across Snowflake, MySQL, Mongo, and Salesforce—DAB benchmarks, DocETL, tribal knowledge, and agent-first databases, with verifiable claims separated from speaker opinion.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-data-agents-with-shreya-shankar-weaviate-podcast-135/cover.png"/></item><item><title>Engineering Trade-offs in Retrieval Embeddings: Leaderboards, Training, and Production Constraints via Arctic Embed</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-arctic-embed-with-luke-merrick-puxuan-yu-and-charles-pierse-weaviate-pod/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-arctic-embed-with-luke-merrick-puxuan-yu-and-charles-pierse-weaviate-pod/</guid><description>Engineering trade-offs in retrieval embeddings: how to read leaderboards, what contrastive pre-training and fine-tuning each solve, how Matryoshka representation learning scales to billion-vector indexes, and the gap between multilingual benchmarks and proprietary distributions—grounded in Snowflake Arctic Embed and the Weaviate podcast.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-arctic-embed-with-luke-merrick-puxuan-yu-and-charles-pierse-weaviate-pod/cover.png"/></item><item><title>Enterprise AI on Exabyte-Scale Unstructured Content: Permissions, Layered Retrieval, and Agent Boundaries</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-box-ai-with-ben-kus-and-bob-van-luijt-weaviate-podcast-120/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-box-ai-with-ben-kus-and-bob-van-luijt-weaviate-podcast-120/</guid><description>Enterprise AI on exabyte-scale unstructured content: permissions, layered retrieval, and agent boundaries—engineering lessons from Box × Weaviate on ACL-aware RAG, embedding economics, and production agents.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-box-ai-with-ben-kus-and-bob-van-luijt-weaviate-podcast-120/cover.png"/></item><item><title>Enterprise RAG and Agents: From Frankenstein Pipelines to an Optimizable Whole System</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-contextual-ai-with-amanpreet-singh-weaviate-podcast-114/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-contextual-ai-with-amanpreet-singh-weaviate-podcast-114/</guid><description>Enterprise RAG and agents: from stitched-together pipelines to an end-to-end optimizable system—RAG 2.0, active retrieval, preference learning (KTO/APO), and LMUnit-style evaluation, with evidence boundaries called out.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-contextual-ai-with-amanpreet-singh-weaviate-podcast-114/cover.png"/></item><item><title>Enterprise RAG and Agents: When Vector Databases Meet Four Decades of Analytics Software</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-saurabh-mishra-and-bob-van-luijt-on-weaviate-and-sas-weaviate-podcast-12/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-saurabh-mishra-and-bob-van-luijt-on-weaviate-and-sas-weaviate-podcast-12/</guid><description>Enterprise RAG and agents when vector databases meet four decades of analytics software—engineering tensions in regulated industries, SAS RAM, Weaviate integration, and production boundaries.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-saurabh-mishra-and-bob-van-luijt-on-weaviate-and-sas-weaviate-podcast-12/cover.png"/></item><item><title>Enterprise RAG on Financial Research Corpora: Engineering Trade-offs in Vector Stores, Agents, and Eval</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-morningstar-intelligence-engine-with-aravind-kesiraju-weaviate-podcast-1/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-morningstar-intelligence-engine-with-aravind-kesiraju-weaviate-podcast-1/</guid><description>Enterprise RAG on financial research corpora: engineering trade-offs across vector stores, agents, and eval—ingestion throughput, retrieval granularity, entitlements, and agent latency.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-morningstar-intelligence-engine-with-aravind-kesiraju-weaviate-podcast-1/cover.png"/></item><item><title>From RAG to Search Agents: Three Tensions in Retrieval, Synthetic Data, and Evaluation</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-search-agents-with-nandan-thakur-weaviate-podcast-137/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-search-agents-with-nandan-thakur-weaviate-podcast-137/</guid><description>From RAG to search agents: BEIR co-author Nandan Thakur on BrowseComp-Plus, synthetic data pipelines, GRPO economics, and why retrieval benchmarks, training cost, and harness design pull in different directions.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-search-agents-with-nandan-thakur-weaviate-podcast-137/cover.png"/></item><item><title>Judge-Time Compute: When LLM Evaluation Moves from a Single Score to a Composable Pipeline</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-haize-labs-with-leonard-tang-weaviate-podcast-121/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-haize-labs-with-leonard-tang-weaviate-podcast-121/</guid><description>Judge-time compute: stacking structured, composable weak-model calls at evaluation time instead of assuming one expensive judge pass is enough—Verdict, agreement metrics, and production guardrails, with evidence boundaries called out.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-haize-labs-with-leonard-tang-weaviate-podcast-121/cover.png"/></item><item><title>Multi-Stage Language Programs and Automatic Prompt Optimization: From DSPy to MIPRO</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-mipro-and-dspy-with-krista-opsahl-ong-weaviate-podcast-103/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-mipro-and-dspy-with-krista-opsahl-ong-weaviate-podcast-103/</guid><description>Multi-stage language programs and automatic prompt optimization: from DSPy to MIPRO—proposal, bootstrapping, and combinatorial search; credit assignment; meta-proposers; and how they relate to RAG, agents, and fine-tuning.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-mipro-and-dspy-with-krista-opsahl-ong-weaviate-podcast-103/cover.png"/></item><item><title>Multi-Vector Search: Choosing Among Single-Vector, Late Interaction, and Cascaded Reranking</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-multi-vector-search-with-ame-lie-chatelain-and-antoine-chaffin-weaviate/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-multi-vector-search-with-ame-lie-chatelain-and-antoine-chaffin-weaviate/</guid><description>Multi-vector search: how to choose among single-vector bi-encoders, late interaction (ColBERT-family), and cascaded reranking—grounded in the Weaviate podcast with LightOn&amp;rsquo;s Amélie Chatelain and Antoine Chaffin.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-multi-vector-search-with-ame-lie-chatelain-and-antoine-chaffin-weaviate/cover.png"/></item><item><title>Query Agent on a Vector Database: Auditable Retrieval and Two Ways to Ask Your Data</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-weaviate-s-query-agent-with-charles-pierse-weaviate-podcast-128/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-weaviate-s-query-agent-with-charles-pierse-weaviate-podcast-128/</guid><description>Query Agent on a vector database: auditable retrieval, Ask vs Search modes, schema introspection, multi-collection routing, and what is verified in docs versus speaker claims.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-weaviate-s-query-agent-with-charles-pierse-weaviate-podcast-128/cover.png"/></item><item><title>REFRAG: Turning RAG Context from a Token String into a Compressible Representation</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-refrag-with-xiaoqiang-lin-weaviate-podcast-130/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-refrag-with-xiaoqiang-lin-weaviate-podcast-130/</guid><description>REFRAG compresses retrieved passages into chunk-level decoder positions, then uses RL to selectively expand high-entropy spans—mechanisms, training pipeline, and how to read TTFT and RAG benchmarks without over-generalizing paper numbers.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-refrag-with-xiaoqiang-lin-weaviate-podcast-130/cover.png"/></item><item><title>Retrieval List Diversification: Geometric Post-Processing, Evaluation Gaps, and RAG Context Budgets</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-pyversity-with-thomas-van-dongen-weaviate-podcast-132/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-pyversity-with-thomas-van-dongen-weaviate-podcast-132/</guid><description>Retrieval list diversification: geometric post-processing, evaluation gaps, and RAG context budgets—MMR, MSD, DPP, Cover, and SSD as NumPy reranking after any Python retrieval stack.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-pyversity-with-thomas-van-dongen-weaviate-podcast-132/cover.png"/></item><item><title>Scaling DataFrames: When Notebook Habits Meet Distributed Execution</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-scaling-pandas-with-devin-petersohn-weaviate-podcast-101/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-scaling-pandas-with-devin-petersohn-weaviate-podcast-101/</guid><description>Scaling DataFrames: when notebook habits meet distributed execution—pandas semantics, Modin&amp;rsquo;s compiler stack, Snowflake ordering, Parquet pushdown, quote-aware CSV, Ray data movement, and what is verified vs. speaker opinion.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-scaling-pandas-with-devin-petersohn-weaviate-podcast-101/cover.png"/></item><item><title>Semantic Query Engines: When LLM Operators Enter the Query Optimizer</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-semantic-query-engines-with-matthew-russo-weaviate-podcast-131/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-semantic-query-engines-with-matthew-russo-weaviate-podcast-131/</guid><description>Semantic query engines treat foundation-model filter, join, classify, map, and rank as first-class operators—logical and physical plans, cost–quality tradeoffs, SemBench workloads, and how they differ from script-style RAG and vector search alone.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-semantic-query-engines-with-matthew-russo-weaviate-podcast-131/cover.png"/></item><item><title>Software Engineering Agents on Real Repositories: SWE-Bench and the Debate Over Evaluation Scaffolding</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-swe-bench-with-john-yang-and-carlos-e-jimenez-weaviate-podcast-107/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-swe-bench-with-john-yang-and-carlos-e-jimenez-weaviate-podcast-107/</guid><description>Software engineering agents on real repositories: SWE-Bench benchmarks GitHub issue → patch → tests green, while SWE-agent pushes the debate onto Agent-Computer Interface design—separating verified docs from speaker opinion.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-swe-bench-with-john-yang-and-carlos-e-jimenez-weaviate-podcast-107/cover.png"/></item><item><title>Stateful Agents and Context Compilation: The Engineering Divide from MemGPT to Letta</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-letta-ai-with-sarah-wooders-weaviate-podcast-117/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-letta-ai-with-sarah-wooders-weaviate-podcast-117/</guid><description>Stateful agents and context compilation: how Letta (from MemGPT) treats the context window as a compiled runtime view—memory tiers, agentic RAG, tool-call unification, multi-agent blocks, and observability—with evidence boundaries called out.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-letta-ai-with-sarah-wooders-weaviate-podcast-117/cover.png"/></item><item><title>Structured Outputs: From Parseable JSON to Logit-Level Constrained Generation</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-structured-outputs-with-will-kurt-and-cameron-pfiffer-weaviate-podcast-1/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-structured-outputs-with-will-kurt-and-cameron-pfiffer-weaviate-podcast-1/</guid><description>Structured outputs: from parseable JSON to logit-level constrained generation—why RAG pipelines and agents need generation-time constraints, how FSMs and coalescence work, and how to choose between API guarantees and self-hosted logits masking.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-structured-outputs-with-will-kurt-and-cameron-pfiffer-weaviate-podcast-1/cover.png"/></item><item><title>Sufficient Context: RAG Should Measure Whether There's Enough to Answer, Not Just Whether Chunks Look Relevant</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-sufficient-context-with-hailey-joren-weaviate-podcast-125/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-sufficient-context-with-hailey-joren-weaviate-podcast-125/</guid><description>Sufficient context asks whether retrieved chunks let a model answer the question—not just whether they look relevant. A Weaviate Podcast #125 walkthrough of Joren et al. (ICLR 2025) on RAG evaluation, abstention, and selective generation.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-sufficient-context-with-hailey-joren-weaviate-podcast-125/cover.png"/></item><item><title>Synthetic Data: Boundaries of Data Fabrication in RAG, Agents, and Evaluation</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-synthetic-data-with-david-berenstein-and-ben-burtenshaw-weaviate-podcast/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-synthetic-data-with-david-berenstein-and-ben-burtenshaw-weaviate-podcast/</guid><description>Synthetic data for RAG, agents, and offline evaluation—when to augment, how to trust the distribution, and pipelines from distilabel and Persona Hub to Hub SQL and quality filters.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-synthetic-data-with-david-berenstein-and-ben-burtenshaw-weaviate-podcast/cover.png"/></item><item><title>The Boundaries of Enterprise RAG: Managed Pipelines, Vector Stores, and Write-Back Retrieval</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-vertex-ai-rag-engine-with-lewis-liu-and-bob-van-luijt-weaviate-podcast-1/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-vertex-ai-rag-engine-with-lewis-liu-and-bob-van-luijt-weaviate-podcast-1/</guid><description>The boundaries of enterprise RAG: managed pipelines, vector stores, and write-back retrieval—engineering lessons from Vertex AI RAG Engine × Weaviate on parsing leverage, multi-corpus routing, and generative feedback loops.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-vertex-ai-rag-engine-with-lewis-liu-and-bob-van-luijt-weaviate-podcast-1/cover.png"/></item><item><title>The Multi-Vector Retrieval Index Paradox: How MUVERA Approximates Chamfer with Single-Vector ANN</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-muvera-with-rajesh-jayaram-and-roberto-esposito-weaviate-podcast-123/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-muvera-with-rajesh-jayaram-and-roberto-esposito-weaviate-podcast-123/</guid><description>Models like ColBERT and ColPali represent documents as token-level vector sets and pay for finer alignment with late interaction (MaxSim/Chamfer)—but index entries explode from one per document to hundreds. Google Research&amp;rsquo;s MUVERA compresses each set into a single fixed-dimensional encoding for one ANN pass, then reranks with true Chamfer; this article separates paper facts from podcast opinion for engineers shipping multi-vector search.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-muvera-with-rajesh-jayaram-and-roberto-esposito-weaviate-podcast-123/cover.png"/></item><item><title>When Format Constraints Hurt LLMs: A Split Between Agent Pipelines and Benchmark Evaluation</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-let-me-speak-freely-with-zhi-rui-tam-weaviate-podcast-108/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-let-me-speak-freely-with-zhi-rui-tam-weaviate-podcast-108/</guid><description>When format constraints hurt LLMs: the same structured-output techniques often lower scores on reasoning tasks and raise them on discrete classification—from agent pipelines to benchmark evaluation.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-let-me-speak-freely-with-zhi-rui-tam-weaviate-podcast-108/cover.png"/></item><item><title>When Queries Become Whole Blocks of Code: The Split Between RAG Evaluation and Search-Style Benchmarks</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-rag-benchmarks-with-nandan-thakur-weaviate-podcast-124/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-rag-benchmarks-with-nandan-thakur-weaviate-podcast-124/</guid><description>Production RAG no longer matches short-query IR leaderboards—BEIR co-author Nandan Thakur on why search benchmarks and long-context, nugget-level RAG evaluation are diverging axes.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-rag-benchmarks-with-nandan-thakur-weaviate-podcast-124/cover.png"/></item><item><title>When Scalar Reward Isn't Enough: Reflective Text Evolution in GEPA and Compound AI</title><link>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-gepa-with-lakshya-a-agrawal-weaviate-podcast-127/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-gepa-with-lakshya-a-agrawal-weaviate-podcast-127/</guid><description>When scalar reward isn&amp;rsquo;t enough: GEPA&amp;rsquo;s reflective prompt evolution and per-instance Pareto retention for compound AI language programs—natural-language feedback, LangProBe benchmarks, and how it compares to GRPO and MIPROv2.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://neatguycoding.com/posts/2026-05-18-weaviate-podcast-gepa-with-lakshya-a-agrawal-weaviate-podcast-127/cover.png"/></item></channel></rss>