Podcast

Infosec Briefing: CA Support Desk Breach, Kernel Privilege Escalation, and ATT&CK Tactical Restructuring

18 May 2026·1401 words·7 mins

Infosec Briefing: CA Support Desk Breach, Kernel Privilege Escalation, and ATT&CK Tactical Restructuring

From RAG to Search Agents: Three Tensions in Retrieval, Synthetic Data, and Evaluation

18 May 2026·2047 words·10 mins

From RAG to search agents: BEIR co-author Nandan Thakur on BrowseComp-Plus, synthetic data pipelines, GRPO economics, and why retrieval benchmarks, training cost, and harness design pull in different directions.

Enterprise RAG on Financial Research Corpora: Engineering Trade-offs in Vector Stores, Agents, and Eval

18 May 2026·2023 words·10 mins

Enterprise RAG on financial research corpora: engineering trade-offs across vector stores, agents, and eval—ingestion throughput, retrieval granularity, entitlements, and agent latency.

Enterprise RAG and Agents: When Vector Databases Meet Four Decades of Analytics Software

18 May 2026·2323 words·11 mins

Enterprise RAG and agents when vector databases meet four decades of analytics software—engineering tensions in regulated industries, SAS RAM, Weaviate integration, and production boundaries.

Enterprise RAG and Agents: From Frankenstein Pipelines to an Optimizable Whole System

18 May 2026·2149 words·11 mins

Enterprise RAG and agents: from stitched-together pipelines to an end-to-end optimizable system—RAG 2.0, active retrieval, preference learning (KTO/APO), and LMUnit-style evaluation, with evidence boundaries called out.

Enterprise AI on Exabyte-Scale Unstructured Content: Permissions, Layered Retrieval, and Agent Boundaries

18 May 2026·2260 words·11 mins

Enterprise AI on exabyte-scale unstructured content: permissions, layered retrieval, and agent boundaries—engineering lessons from Box × Weaviate on ACL-aware RAG, embedding economics, and production agents.

Engineering Trade-offs in Retrieval Embeddings: Leaderboards, Training, and Production Constraints via Arctic Embed

18 May 2026·2236 words·11 mins

Engineering trade-offs in retrieval embeddings: how to read leaderboards, what contrastive pre-training and fine-tuning each solve, how Matryoshka representation learning scales to billion-vector indexes, and the gap between multilingual benchmarks and proprietary distributions—grounded in Snowflake Arctic Embed and the Weaviate podcast.

eCHO 201: 2026 Networking, eBPF, and Security Predictions — Technical Notes

18 May 2026·1198 words·6 mins

eCHO 201: 2026 networking, eBPF, and security predictions — technical notes with evidence boundaries for each claim.

Data Agents: When Code-Writing Models Meet the Real Data Stack

18 May 2026·2143 words·11 mins

Data agents across Snowflake, MySQL, Mongo, and Salesforce—DAB benchmarks, DocETL, tribal knowledge, and agent-first databases, with verifiable claims separated from speaker opinion.

Compound AI: When a Single LLM Call Is Not Enough

18 May 2026·2083 words·10 mins

Compound AI: When a single LLM call is not enough—multiple model calls, retrievers, tools, and business logic as a graph; structured output, specialist pipelines, inference stacks, and deployment granularity from a Weaviate podcast with Baseten’s Philip Kiely.

Cilium at Ten: Community Scale, Survey Signals, and the 1.19 Technical Thread

18 May 2026·1382 words·7 mins

Cilium at Ten: Community Scale, Survey Signals, and the 1.19 Technical Thread

Cilium 1.19: What to Verify Before You Upgrade

18 May 2026·1392 words·7 mins

Cilium 1.19: What to verify before you upgrade — an upgrade checklist from the eCHO walkthrough of the 1.19.0 release.

Architectural Tension in the Voice-Agent Era: SSMs, Low-Latency TTS, and Whether End-to-End Eats the Orchestration Stack

18 May 2026·1950 words·10 mins

Architectural tension in the voice-agent era: SSMs, low-latency TTS, and whether end-to-end models will displace compound orchestration chains.

AI-Powered Search: When RAG, Agents, and Classic IR Get Rewired

18 May 2026·2081 words·10 mins

AI-Powered Search: When RAG, agents, and classic IR get rewired—retrieval quality vs. agent loops, long context vs. searchable history, leaderboard embeddings vs. domain corpora, with Doug Turnbull and Trey Grainger on what ships.

Agents on Semi-Structured Retrieval: STaRK Benchmark and AvaTaR Optimization

18 May 2026·2209 words·11 mins

Stanford’s STaRK benchmark and AvaTaR contrastive optimization for retrieval agents on semi-structured knowledge bases—metrics, multi-vector limits, when agents lose to dense retrievers, and what to ship in production.

Agentic Topic Modeling: Embedding Pipelines, LLMs, and Human-in-the-Loop Engineering Trade-offs

18 May 2026·2106 words·10 mins

Agentic topic modeling: modular embedding pipelines, LLM-maintained topic tables, and human-in-the-loop granularity—engineering trade-offs between BERTopic, TopicGPT, and retrieval-scale deployment.

Agentic RAG: When Retrieval Pipelines Grow a Planning-and-Tools Loop

18 May 2026·2207 words·11 mins

Agentic RAG: When retrieval pipelines add LLM plan–act–observe loops, tool calling, and multi-step validation—separating verified docs from interview speculation for production teams.

Agent Oversight Stack: From Static Evaluation to Trajectory-Level Observability

18 May 2026·2020 words·10 mins

Agent oversight stack: from static evaluation to trajectory-level observability—evaluation, observability, and supervision for multi-agent systems, with Percival, Lynx, and Glider, and evidence boundaries called out.

↑