A collection of general-purpose scripts for file processing, document conversion, and audio transcription.
You can install this toolkit directly from GitHub using pip. This will automatically install all dependencies and register the terminal CLI commands:
pip install git+https://github.com/Khumee/utility-deck.gitNote: For the PDF conversion tools, you will also need to install the Playwright browser dependencies:
playwright install chromiumOnce installed, you can run these commands directly from any directory in your terminal:
Transcribes audio files (like .mp3, .ogg, .wav) to text using Google's Gemini API:
# Set your API Key first (e.g. in Windows PowerShell)
[Environment]::SetEnvironmentVariable("GEMINI_API_KEY", "your_api_key", "User")
# Run transcription
audio-transcribe --file "my_recording.ogg" --language "Urdu"Converts HTML pages or folders of HTML files into print-ready A4 PDFs:
html-to-pdf --input "document.html" --output "output.pdf"Converts Markdown .md documents directly to PDF format:
md-to-pdf "document.md" "output.pdf"Extracts plain text contents from a PDF file:
extract-pdf "document.pdf"