When team documents are scattered across Notion, GitHub Issues, and S3 buckets, building connectors for each source becomes the bottleneck. OpenDocuments is an open-source RAG platform that has that wiring pre-built.

Over 12 Connectors

It connects to Notion, GitHub, S3, PDFs, and Jupyter notebooks. Using Ollama, data stays entirely within the server with no external API calls. This architecture fits environments with strict network isolation like financial institutions or retail companies.

Built on SQLite and LanceDB, installation finishes with a single Docker command. Without heavy infrastructure, spinning it up as a side project or internal team tool feels lightweight. Once configured, embedding and storage happen automatically when files come in. No manual classification work needed from the admin.

Hybrid Search Is the Default

It comes with hybrid search combining vector and keyword approaches, plus reranking. Even mixed Korean-English queries pick up context and cite sources. Displaying the basis documents for each answer reduces hallucination concerns.

Switching from local to a high-performance external API is a one-line config change. It supports MCP servers so you can plug it directly into coding agents, and the plugin architecture is open for adding custom parsers and connectors.


Key Takeaways

  • Ollama integration enables on-premise RAG without external APIs, suitable for network-isolated environments
  • Hybrid search with reranking returns accurate sources even for Korean-English cross-language queries
  • MCP server support lets you connect coding agents directly to internal knowledge bases

Source: https://news.hada.io/topic?id=27910