Skip to content

Latest commit

 

History

History
268 lines (183 loc) · 7.33 KB

File metadata and controls

268 lines (183 loc) · 7.33 KB

📖 User Guide — NullifyPDF

Your step-by-step guide to redacting sensitive data from PDFs safely and securely.

Tip

This takes about 5 minutes. New to PDF redaction? Start with Step 1 below.


📋 Quick Overview

🎯 Step ⏱️ Time What You'll Do
1️⃣ Load 30 sec Open a PDF file
2️⃣ Configure 30 sec Select document language
3️⃣ Auto-Redact 1-2 min Run AI to find sensitive data
4️⃣ Review 1-2 min Manually fix any mistakes
5️⃣ Blindfold 30 sec Hide images (optional)
6️⃣ Export 30 sec Save the redacted PDF

Step 1️⃣ — Load Your PDF

  1. Click the blue "Open PDF" button in the top left
  2. Select a PDF file from your computer
  3. The document appears in the center. Scroll with mouse or use arrow buttons (top right)

Note

Supported formats: PDF (unencrypted, uncompressed text)


Step 2️⃣ — Configure AI Language

Before scanning, select your document's language from the left sidebar:

🌐 Language Use When
EN Document is entirely in English
IT Document is entirely in Italian
BOTH Document contains mixed languages

Warning

Wrong language = missed detections. Choose the exact language of your document.


Step 3️⃣ — Automatic Redaction (AI Scan)

Click the "Auto Redact (AI)" button. NullifyPDF will automatically find and flag:

  • 🧑 Names & Surnames
  • 🏙️ Cities & Addresses
  • 📧 Email Addresses
  • 📱 Phone Numbers
  • 💳 IBANs & Credit Card Numbers
  • Cryptocurrency Addresses

What to expect:

  • Black boxes appear over detected data
  • Progress bar shows scanning progress
  • On large PDFs (500+ pages): 30-60 seconds normal
  • UI stays responsive (no freezing)

Tip

The AI isn't perfect. You'll review results in Step 4.


Step 4️⃣ — Manual Review & Corrections

No AI is 100% accurate. You can fix mistakes:

Add a Redaction

Draw a box over text you want to hide:

  1. Click and drag mouse over sensitive text
  2. A black box appears
  3. Redaction is scheduled (not destructive yet)

Remove a Redaction

Click once on a black box to remove it:

  1. Single click on the redaction box
  2. Box disappears
  3. Word is added to Allowlist (AI will ignore it next time)

Zoom In/Out

Make small text larger:

  • Press + / - keys, OR
  • Hold CTRL + Mouse wheel up/down

Important

Changes are preview only until you export. You can undo everything before export.


Step 5️⃣ — Hide Images & Photos (Optional)

Remove logos, signatures, or scanned photos:

  1. Enable the toggle: "Blindfold Mode" (sidebar)
  2. Click "Auto Redact (AI)" again
  3. All images replaced with gray placeholder: [ IMAGE REMOVED ]

Note

Use this for scanned documents where text is embedded in images (no OCR available).


Step 6️⃣ — Export the Secure PDF

When satisfied with redactions:

  1. Click "Export Secure PDF" button
  2. Choose filename and location
  3. The new PDF is now forensically sanitized:
    • Black boxes are binary-level destroyed (not recoverable)
    • Metadata removed
    • Hidden links destroyed
    • Interactive forms flattened

Caution

Export is destructive and permanent. Keep a backup of the original PDF.


🔧 Dictionary Management

Your redaction preferences are saved automatically:

Storage Location

OS Path
Windows C:\Users\<username>\.nullifypdf\
macOS ~/.nullifypdf/
Linux ~/.nullifypdf/

Files Inside

.nullifypdf/
├── blocklist.txt     ← Words to ALWAYS redact
├── allowlist.txt     ← Words to NEVER redact
├── logs/
│   └── nullifypdf.log

How to Edit Manually

📝 Edit Blocklist/Allowlist Directly
  1. Open file manager, navigate to ~/.nullifypdf/
  2. Edit blocklist.txt or allowlist.txt with any text editor
  3. One word per line, UTF-8 encoding
  4. Restart NullifyPDF to load changes

Example blocklist.txt:

john smith
acme corporation
admin@example.com

Mutual Exclusivity

⚠️ A word cannot be in BOTH lists simultaneously.

If you add a word to Allowlist that's already in Blocklist, it's removed from Blocklist automatically.


✅ Best Practices

  1. Backup Original — Always keep the original PDF before redacting
  2. Test on Copy — Redact a test copy first, verify results
  3. Verify Export — Open exported PDF to confirm redactions look correct
  4. Choose Language Carefully — Wrong language = missed detections
  5. Use Allowlist Sparingly — Only add words you're 100% sure aren't sensitive

📊 Logging & Diagnostics

View Logs

Log files track all activity. Location:

OS Path
Windows C:\Users\<username>\.nullifypdf\logs\nullifypdf.log
macOS/Linux ~/.nullifypdf/logs/nullifypdf.log

Log Format

2026-06-06 14:23:45 - INFO - Loaded PDF: document.pdf
2026-06-06 14:23:50 - INFO - AI scan complete: 5 detections
2026-06-06 14:24:10 - ERROR - Export failed

Enable Debug Mode

For detailed troubleshooting, enable verbose logging:

🔍 Windows (PowerShell)
$env:NULLIFYPDF_DEBUG = "true"
python NullifyPDF.py
🔍 macOS/Linux (Bash)
export NULLIFYPDF_DEBUG=true
python3.12 NullifyPDF.py

Effect: Debug logs include full stack traces. Useful when reporting bugs.

Log Rotation

  • Max file size: 5 MB
  • Backup copies: 3 old logs auto-deleted
  • Encoding: UTF-8

❓ Common Questions

Q: Why doesn't AI detect text in my scanned PDF?

A: NullifyPDF analyzes only digital text, not images. Use Blindfold Mode to hide entire image blocks.

Q: Can I password-protect the exported PDF?

A: Not built-in. Use a PDF editor after export for password protection.

Q: How do I know redactions are truly destroyed?

A: NullifyPDF performs binary-level destruction. See SECURITY.md for technical details.

Q: Can I undo changes after export?

A: No. Export is permanent and destructive. Always keep the original.

Q: Does NullifyPDF send data to the cloud?

A: No. 100% local processing. No network calls except GitHub release checks.


🚀 Next Steps


Last updated: 2026-06-06
Back to README | Troubleshooting →