Octen AI, an AI-native search infrastructure company, has secured $10.0 million in a recent funding round. Founded in 2025 and headquartered in California and Singapore, the company is dedicated to building the ultimate retrieval layer for the Large Language Model (LLM) era, providing essential search infrastructure services to developers and enterprises globally.
The company offers an end-to-end Web Search API specifically architected for AI agents. This technology bridges the gap between LLMs and the dynamic web, ensuring that agentic queries are anchored in relevant, real-time, and high-fidelity information. Octen AI's infrastructure delivers production-grade performance, featuring a 99ms average response time, minute-level index updates, and support for over 1 million queries per second (QPS).
This capital infusion will support Octen AI's strategic growth initiatives, including further product development and market expansion. The company has already established its Octen-Embedding Series, with Octen-8B setting new global benchmarks and becoming a recognized embedding model for precision and long-context understanding. The funding will also bolster the continued development of its proprietary, ultra-large-scale distributed search engine, engineered for maximum scalability, real-time freshness, and ultra-low latency.
The investment underscores the increasing demand for robust infrastructure capable of meeting the rigorous demands of AI-driven search scenarios. Octen AI plans to leverage this funding to continue redefining search standards and power the next generation of AI-driven applications and agentic workflows worldwide.










