Skip to content
@Thordata

Thordata · AI Proxy & Web Data

AI Proxy & Web Data

⚡ Thordata

The AI-Native Web Data Infrastructure for Developers & Agents

🌐 Website📚 Documentation📊 Dashboard📧 Support

Uptime Proxies AI Ready License


🚀 Why Thordata?

Thordata is the next-generation web scraping and proxy infrastructure designed for the AI era. While traditional providers focus on manual scraping, we build pipelines that feed data directly into LLMs, RAG systems, and AI Agents.

We process over 100M+ requests daily with a focus on speed, success rates, and developer experience.

🌟 Core Value Proposition

  • AI-First Architecture: Native support for MCP (Model Context Protocol) and LangChain.
  • Unblockable Infrastructure: Proprietary Web Unlocker technology that handles Captchas, fingerprints, and JS rendering automatically.
  • Massive IP Network: Ethical access to 60M+ Residential, Mobile, and ISP IPs in 181+ locations.
  • Developer Centric: Modern SDKs, strictly typed responses, and "Copy-Paste" ready examples.

🛠️ The Ecosystem

1. 🧠 AI & LLM Integrations (Next-Gen)

Empower your AI agents to browse the web and fetch real-time context.

Repository Description Status
thordata-mcp-server 🤖 AI Bridge: Connect Claude Desktop / OpenAI directly to real-world web data via MCP protocol. ✅ Stable
thordata-rag-pipeline 🔍 RAG Ready: Clean, structured data extraction pipeline optimized for Vector Databases. 🚧 Beta
thordata-langchain-tools 🦜🔗 LangChain: Official tools to turn Thordata into a web-browsing tool for your agents. 🚧 TBD

2. ⚡ Official SDKs

Type-safe, robust, and production-ready libraries for your stack.

Language Repository Features
Python thordata-python-sdk The flagship SDK. Async support, full typing, deeply integrated with Pandas/AI stacks.
Node.js thordata-js-sdk TypeScript ready. Perfect for serverless and puppeteer/playwright integrations.
Go thordata-go-sdk High-concurrency client for enterprise-grade scraping systems.
Java thordata-java-sdk Enterprise compliant, thread-safe implementation.

3. 🕸️ Scraping Solutions

From raw HTML to structured JSON, we handle the complexity.

  • SERP API: Real-time search results from Google, Bing, Yandex (Search, Shopping, Maps, News).
  • Web Scraper API: "Swiss Army Knife" for any URL. Handles rendering, waiting, and extraction.
  • Scraping Browser: Headless browsers hosted on our cloud. Connect via CDP/Selenium/Puppeteer.

💻 Quick Start (Python)

Install our flagship SDK:

pip install thordata

Scenario: Search Google for "AI Agents" and get JSON results

import os
from thordata import ThorClient

# Initialize with your tokens
client = ThorClient(
    scraper_token=os.getenv("THORDATA_SCRAPER_TOKEN"),
    public_token=os.getenv("THORDATA_PUBLIC_TOKEN"),
    public_key=os.getenv("THORDATA_PUBLIC_KEY")
)

# 1. SERP Search (Google)
results = client.serp.search(
    engine="google",
    q="AI Agents using Web Data",
    location="United States",
    num=5
)

for item in results.get('organic_results', []):
    print(f"Title: {item['title']}")
    print(f"Link: {item['link']}")

# 2. Universal Scrape (Any URL)
html_content = client.universal.request(
    url="https://www.example.com",
    js_render=True,
    country="us"
)

🌍 Global Proxy Network

We provide the foundation for anonymous web access.

Type Repository / Docs Use Case
Residential Docs 60M+ IPs. Perfect for high-trust scraping (Social, E-commerce).
Datacenter Docs High speed, low cost. Best for market intelligence.
ISP Docs Static residential IPs. Keep the same session for banking/login flows.
Mobile Docs 3G/4G/5G IPs for mobile-only app verification.

🤝 Community & Support

We are building for the developers.

  • 🐛 Found a bug? Open an issue in the respective repository.
  • 💡 Feature Request? Check our Roadmap or discuss in Discussions.
  • 📧 Enterprise Inquiry? Contact business@thordata.com for custom plans (>1TB/month).

© 2024-2026 Thordata Inc. All rights reserved. Built with ❤️ for the data community.

Pinned Loading

  1. thordata-python-sdk thordata-python-sdk Public

    The official Python SDK for Thordata Proxy Infrastructure.

    Python 1

  2. thordata-mcp-server thordata-mcp-server Public

    Official MCP Server for Thordata: Connect AI Agents to Real-Time Web Data.

    Python 2

  3. thordata-rag-pipeline thordata-rag-pipeline Public

    🚀 Production-grade RAG pipeline powered by Thordata Scrapers. Turn any website, app reviews, or e-commerce data into clean, searchable AI knowledge.

    Python 1

Repositories

Showing 10 of 18 repositories

Sponsoring

  • @lexiforest
  • @s0md3v
  • @unclecode

Top languages

Loading…

Most used topics

Loading…