tarzi 0.0.21

Rust-native lite search for AI applications
Documentation
<div align="center">
  <img src="https://github.com/mirasurf/tarzi/blob/4e751f8d389c0ac7f2061afa9286d2d7fa551aaf/static/tarzi-320.png" alt="Tarzi Logo" width="200" height="200">
</div>
<h1 align="center">tarzi</h1>  
<p align="center">
  <a href="https://crates.io/crates/tarzi">
    <img src="https://img.shields.io/crates/v/tarzi.svg?style=flat-square" alt="Crate Version" />
  </a>
  <a href="https://pypi.org/project/tarzi/">
    <img src="https://img.shields.io/pypi/v/tarzi.svg?style=flat-square" alt="PyPI Version" />
  </a>
  <!-- CI and Docs -->
  <a href="https://github.com/mirasurf/tarzi/actions/workflows/rust-ci.yml">
    <img src="https://github.com/mirasurf/tarzi/actions/workflows/rust-ci.yml/badge.svg" alt="Rust CI" />
  </a>
  <a href="https://github.com/mirasurf/tarzi/actions/workflows/python-ci.yml">
    <img src="https://github.com/mirasurf/tarzi/actions/workflows/python-ci.yml/badge.svg" alt="Python CI" />
  </a>
  <a href="https://tarzirs.readthedocs.io/en/latest/">
    <img src="https://app.readthedocs.org/projects/tarzirs/badge/?version=latest&style=flat" alt="Docs" />
  </a>
  <!-- License -->
  <a href="https://www.apache.org/licenses/LICENSE-2.0">
    <img src="https://img.shields.io/badge/License-Apache%202.0-blue.svg?style=flat-square" alt="License" />
  </a>
  <!-- X (formerly Twitter) -->
  <a href="https://x.com/mirasurf_ai">
    <img src="https://img.shields.io/twitter/follow/mirasurf_ai?label=@mirasurf_ai&style=flat-square" alt="X Follow" />
  </a>
</p>

## ๐Ÿ’ Tarzi

**Tarzi** is a unified search interface designed for **Retrieval-Augmented Generation (RAG)** and **agentic systems** built on large language models. Search is a core functionality in these systems, yet most search engine providers (SEPs) impose API paywalls or strict rate limits. **Tarzi**, empowered by browser automation and web crawling technologies, removes these barriers by supporting token-free queries across multiple search engines. With a single dependency, you can integrate and switch between different SEPs as neededโ€”seamlessly and efficiently.

<div align="center">
  <img src="static/tariz-workflow.png" alt="Tarzi Logo" width="100%">
</div>

## โš™๏ธ Core Capabilities

- ๐Ÿฆ€ **Dual Implementation**: Native Rust library and Python wrapper with CLI tools
- ๐Ÿ”„ **Content Conversion**: Convert raw HTML into Markdown, JSON, or YAML, which is ready for LLMs
- ๐Ÿ” **Search Integration**: Fetch fully rendered result pages with a unified interface for token-free headless browser mode
- ๐Ÿง  **Multi-Engine Support**: Works with Bing, Google, DuckDuckGo, Baidu etc.
- ๐Ÿ›ก๏ธ **Proxy Support**: Bypass network bans using proxy support to access global SEPs
- ๐Ÿš€ **End-to-End Workflow**: Full pipeline from search to content extraction for AI and automation use cases

## ๐Ÿงช Advanced Features (Under dev.)

- ๐Ÿ–ฅ๏ธ **Custom Browser Controls**: Set screen size, viewport, and locale for realistic behavior  
- ๐Ÿ•ต๏ธโ€โ™‚๏ธ **Anti-Bot Evasion**: Use fingerprint spoofing, proxy rotation, and human-like actions to avoid detection  
- ๐Ÿง  **Smarter Queries**: Improve search results with prompt rewriting and intent-aware queries 
- ๐Ÿ”— **Workflow Automation**: Chain steps like search, click, form fill, and scraping into automated flows  
- ๐Ÿค– **Agent Integration (MCP)**: Connect with agent frameworks for context-aware, distributed task execution  
- ๐Ÿ“Š **Observability**: Monitor success rate, latency, CAPTCHA frequency, and export logs for analysis

## Install

```
pip install tarzi
```

## CLI Commands

Tarzi provides two command-line interfaces:

- **`tarzi`**: Native Rust CLI (faster, more efficient)
- **`pytarzi`**: Python CLI (easier to extend, same functionality)

Both CLIs support the same commands and configuration precedence.

## Usage Examples

* Examples in Python and Rust: [examples](/examples/)

## Alternatives

* LangChain [PlayWrightBrowserToolkit](https://python.langchain.com/docs/integrations/tools/playwright/)

## Contributors

Thank you โค all human and non-human contributors.

[![tarzi contributors](https://contrib.rocks/image?repo=mirasurf/tarzi "tarzi contributors")](https://github.com/mirasurf/tarzi/graphs/contributors)