Skip to main content

cate transform

Overview

Transform a single file by detecting and replacing sensitive entities (PII). Supports placeholder, masking, and lookup table strategies. Works with text, markdown, HTML, PDF, Word, JSON, YAML, and CSV files.

Usage

python -m src.anonymization.cli transform FILE [OPTIONS]

Options

OptionDescriptionDefault
FILEInput file to transform (positional, required)--
-o, --outputOutput file path<name>_anonymized.<ext>
-c, --configPath to CATE configuration file--
-s, --strategyStrategy: placeholder, mask, lookupplaceholder
-n, --dry-runShow what would be done without writingfalse
--encodingFile encodingutf-8

Prerequisites

Examples

Basic transformation (placeholder strategy)

python -m src.anonymization.cli transform document.txt

Masking strategy with config

python -m src.anonymization.cli transform data.json -s mask -c cate.yaml

Dry run to preview changes

python -m src.anonymization.cli transform data.json --dry-run

Explicit output path

python -m src.anonymization.cli transform input.pdf -o output.pdf