
Prithwish Nath
Articles by Prithwish Nath(18)
Reducing MCP Response Sizes for LLM Context Limits
How to measure per-tool MCP token cost, cut the MCP menu tax, and enforce a hard budget on any MCP server — with under 2% added latency.
How To Build Faster Data Pipelines in Python with Apache Arrow.
How to build web data pipelines 2.6x faster and 60% cheaper with Apache Arrow. Cut memory usage and file sizes by half for AI training.
Building a Fault-Tolerant Web Scraping Pipeline with Effect-TS
Effect-TS changed how I think about web data ingestion in TypeScript. Here's a practical guide for building pipelines that handle failure reliably.
From Google Search to Kafka Event Stream for Many Consumers
How to model SERP changes as Kafka event streams and fan them out to independent consumers.
How Far Can DuckDB Go Without a Cloud Warehouse?
No server. No cloud bill. Just DuckDB and 50M rows on a cheap $500 Acer laptop. Here's the honest truth on where local analytics work best.
Build Your Own, Self-Hosted Google Trends Alternative with DuckDB
Step-by-step guide for a Google SERP tracker with Python, Bright Data, and DuckDB. Free alternative to Semrush and Ahrefs.
Building a Bun CLI That Turns API Docs Pages Into TypeScript Clients
Build a Bun CLI that scrapes API documentation pages and generates fully typed TypeScript clients—no OpenAPI schema required.
How to Validate API Responses in Python with Great Expectations (GX Core)
How to validate any API response in Python, quarantine bad batches, and store only clean results in DuckDB.
The Case For Adding Observability to Data Extraction with OpenTelemetry
Adding OpenTelemetry and Jaeger to expose hidden retries, latency, and silent failures in production.
Fully Local Data Analytics with dbt Core and DuckDB
How to use dbt’s layered SQL models and DuckDB’s embedded engine to clean, normalize, and transform raw web data into analyst-ready tables.
Also on the network
In Plain English
How to Reduce MCP Response Sizes for LLM Context Limits
Jun 25, 2026
5 MCP Servers That Can Replace Expensive SaaS Subscriptions in 2026
Jun 16, 2026
Open-Source Devs Need To Ship Distribution, Not Just Code.
Jun 12, 2026
5 Production Stacks for Live Data Ingestion at Scale (Without Getting Blocked)
Jun 1, 2026
Hurl vs Postman: Git-Friendly API Testing With Proxy-Aware Egress (2026)
Jun 1, 2026
A Practical Guide To Entity Resolution in Python (No Database, No Machine Learning)
Jun 1, 2026
Turning Google into an Explorable Knowledge Graph Using Pure k-NN
May 12, 2026
How Failing at Fantasy Baseball Made Me Fix My Cron Jobs with Temporal
May 5, 2026