Optimizing Large-Scale Excel Downloads in Spring Boot

Optimizing Large-Scale Excel Downloads in Spring Boot

A Spring Boot library for streaming 1M-row Excel downloads without OOM. A single annotation eliminates the boilerplate.

April 1, 2026 · 2 min · Junho Lee
The Unexpected Walls When Converting Web Pages with defuddle

The Unexpected Walls When Converting Web Pages with defuddle

Tried extracting web data for a RAG pipeline with defuddle and found results vary wildly by site structure. On sites where semantic HTML has collapsed, body text and ads mix together, and in dynamic rendering environments the content itself vanishes.

March 31, 2026 · 2 min · Junho Lee