chatpack 0.2.2

Compress chat exports from Telegram, WhatsApp, and Instagram into token-efficient CSV for LLMs
Documentation

πŸ“¦ chatpack

Feed your chat history to LLMs. Compress exports 13x with CSV format.

CI codecov Crates.io Downloads License: MIT

Platforms: Windows β€’ macOS β€’ Linux

The Problem

You want to ask Claude/ChatGPT about your conversations, but:

  • Raw exports are 80% metadata noise
  • JSON structure wastes tokens on brackets and keys
  • Context windows are expensive

The Solution

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ Telegram JSON   β”‚     β”‚          β”‚     β”‚ Clean CSV       β”‚
β”‚ WhatsApp TXT    β”‚ ──▢ β”‚ chatpack β”‚ ──▢│ Ready for LLM   β”‚
β”‚ Instagram JSON  β”‚     β”‚          β”‚     β”‚ 13x less tokens β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Real Numbers

Format Input (Telegram JSON) Output Savings
CSV 11.2M tokens 850K tokens 92% (13x) πŸ”₯
JSONL 11.2M tokens 1.0M tokens 91% (11x)
JSON 11.2M tokens 1.3M tokens 88% (8x)

πŸ’‘ Use CSV for maximum token savings. JSONL is good for RAG pipelines. JSON keeps full structure but wastes tokens.

Use Cases

πŸ’¬ Chat with your chat history

chatpack tg telegram_export.json -o context.txt

# Paste into ChatGPT: "Based on this conversation, what did we decide about...?"

πŸ” Build RAG pipeline

chatpack tg chat.json -f jsonl -t -o dataset.jsonl

# Each line = one document with timestamp for vector DB

πŸ“Š Analyze conversations

chatpack wa chat.txt --from "Alice" --after 2024-01-01 -f json

# Filter and export specific messages

Features

  • πŸš€ Fast β€” 20K+ messages/sec
  • πŸ“± Multi-platform β€” Telegram, WhatsApp, Instagram
  • πŸ”€ Smart merge β€” Consecutive messages from same sender β†’ one entry
  • 🎯 Filters β€” By date, by sender
  • πŸ“„ Formats β€” CSV (13x compression), JSON, JSONL (for RAG)

Installation

Pre-built binaries (recommended)

Platform Download
Windows chatpack-windows-x64.exe
macOS (Intel) chatpack-macos-x64
macOS (Apple Silicon) chatpack-macos-arm64
Linux chatpack-linux-x64

Via Cargo

cargo install chatpack

Quick Start

# Telegram

chatpack tg result.json


# WhatsApp  

chatpack wa chat.txt


# Instagram

chatpack ig message_1.json

Output: optimized_chat.csv β€” ready to paste into ChatGPT/Claude.

Documentation

Guide Description
πŸ“€ Export Guide How to export from Telegram, WhatsApp, Instagram
πŸ“– Usage Guide All commands, flags, filters, formats
πŸ“Š Benchmarks Performance stats and compression metrics
πŸ§ͺ Stress Testing Generate toxic data and run stress tests

Quick Reference

# Output formats

chatpack tg chat.json -f csv      # 13x compression (default)

chatpack tg chat.json -f json     # Structured array

chatpack tg chat.json -f jsonl    # One JSON per line


# Filters  

chatpack tg chat.json --after 2024-01-01

chatpack tg chat.json --from "Alice"


# Metadata

chatpack tg chat.json -t          # Add timestamps

chatpack tg chat.json -t -r -e    # All metadata

Technical Details

Source Format Features
Telegram JSON IDs, timestamps, replies, edits
WhatsApp TXT Auto-detect locale (US/EU/RU), multiline
Instagram JSON Mojibake fix, empty message filter

Performance

Metric Value
Speed 20-50K messages/sec
CSV compression 13x (92% token reduction)
Tested file size 500MB+

License

MIT Β© Mukhammedali Berektassuly