This document outlines privacy considerations, data handling practices, and recommendations for using renAI securely.
renAI processes documents locally and sends text content to Large Language Model (LLM) providers for metadata extraction. Understanding how data flows through the system is essential for making informed decisions about your document privacy.
Local Document β Text Extraction β LLM API β Metadata Response
β
Local Cache Storage
When using cloud LLM providers (DeepInfra, OpenRouter, OpenAI), the following data is transmitted:
| Data Type | Description | Sent to LLM |
|---|---|---|
| Document Text | Extracted text content (up to 10000 chars) | β Yes |
| Sampled Images | Base64 encoded page samples (Vision Mode only) | β Yes |
| Original Images | Full image files when using --rename-images |
β Yes |
| File Metadata | Filename, file size, page count | β No |
| File Content | Full document binary/source | β No |
| Category List | Your custom categories from categories.toml |
β Yes (in prompt) |
RENAI_TEXT_LENGTH, default 10000 characters)| Provider | Data Location | Privacy Level |
|---|---|---|
| DeepInfra | US-based servers | Standard |
| OpenRouter | Various providers | Standard |
| OpenAI | US-based servers | Standard |
Characteristics:
| Tool | Setup Required | Model Options |
|---|---|---|
| Ollama | Install Ollama, run ollama serve |
Llama 3, Mistral, Gemma, etc. |
| LM Studio | Install LM Studio, enable API | Any GGUF model |
| LocalAI | Docker or binary installation | Many options |
| vLLM | Server installation | High-performance |
Characteristics:
renAI utilizes modern OS-agnostic path management via the platformdirs library. Cache files are stored in your operating systemβs standard cache location:
%LOCALAPPDATA%\renAI\Cache~/Library/Caches/renAI~/.cache/renAIThe directory structure is organized as follows:
Cache/
βββ text/ # Extracted text files
βββ metadata/ # metadata files
# Clear all cache (text, metadata, and ocr_debug)
renai cache
renai cache to safely clear cached data.Risk Level: π’ Low
Recommendation: Any provider is suitable
Considerations:
# Example: Using OpenRouter for public books
renai process "C:/Public/Books" --mode rename --provider openrouter
Risk Level: π‘ Medium
Recommendation: Use local LLM or trusted cloud provider
Considerations:
# Example: Using local Ollama for personal documents
# Set these in your OS-specific .env file or environment
RENAI_PROVIDER=custom
CUSTOM_API_BASE_URL=http://localhost:11434/v1
CUSTOM_MODEL=llama3
uv run renai process "C:/Personal/Documents" --mode rename
Risk Level: π΄ High
Recommendation: Use local LLM only
Considerations:
# Example: Using LM Studio for business documents
$env:CUSTOM_API_BASE_URL = "http://localhost:1234/v1"
$env:CUSTOM_MODEL = "your-model"
renai process "C:/Work/Documents" --mode rename --provider custom
Risk Level: π΄π΄ Very High
Recommendation: Local LLM with additional safeguards
Considerations:
export CUSTOM_API_BASE_URL="http://localhost:11434/v1"
export CUSTOM_MODEL="llama3"
renai process "C:/Sensitive/Files" --mode rename --provider custom
# Immediately clear cache after processing
renai cache
Before using any cloud provider, review their:
For local inference servers:
# Bind to localhost only (prevent external access)
# Ollama: OLLAMA_HOST=127.0.0.1:11434 ollama serve
# Configure firewall to block external access
# Instead of command-line arguments (which may be logged)
# Use the .env file in your OS configuration directory
RENAI_PROVIDER=deepinfra
DEEPINFRA_API_KEY=your-api-key-here
uv run renai process "C:/Books" --mode rename
Enable logging and review regularly:
# Check evaluation.log for processing history
cat evaluation.log
# Review console output for errors
| Provider | Data Retention | Training Data | Privacy Rating |
|---|---|---|---|
| Local (Ollama/LM Studio) | You control | N/A | βββββ |
| DeepInfra | 30 days | Check policy | βββ |
| OpenRouter | Varies by provider | Check policy | βββ |
| OpenAI | 30 days | May use | βββ |
After processing sensitive documents:
renai cache--clear-cache to remove cached dataA: Yes, by using a local LLM provider (Ollama, LM Studio). The tool requires an LLM API to function, but it can be entirely local.
A: Only the extracted text (up to 8000 characters) is sent to the LLM provider. The original document files are never uploaded.
A: This varies by provider. Check their terms of service:
A: For cloud providers, you can request data deletion per their policies. For local processing, you have complete control.
A: No, the cache is stored as plain text files. Use renai cache to manage your data or store the OS-specific cache directory on encrypted storage if processing sensitive documents.
A: If the initial LLM request fails, the system may retry with a modified prompt. Each retry sends the text content again.
| Date | Change |
|---|---|
| 2025-01-26 | Initial privacy guide |
| 2026-03-14 | Updated to include platformdirs and new CLI commands |
| 2026-04-09 | Updated for Vision Mode data sharing transparency, uv integration, and .env security |