An AI-powered localization platform that adapts Text, Voice, and Visual Content for different languages, cultures, and regions using Generative AI, OCR, and Speech Processing.
This project was developed as a Mini Project (AIML) 2025-2026 using Flask, Gemini API, EasyOCR, Speech Recognition, and AI-based localization techniques.
The Automated Content Localization Platform helps users localize different types of content:
- π Text Localization
- π€ Voice Localization
- πΌοΈ Visual Localization
The system provides:
- AI-generated multilingual localization
- Region-specific cultural adaptation
- OCR-based text extraction from images
- Speech-to-Text and Text-to-Speech conversion
- Tone-aware content generation
- Translate and localize text
- Supports multiple tones
- Region-aware adaptation
- AI-generated culturally relevant output
- Upload audio files
- Speech-to-Text conversion
- AI localization processing
- Text-to-Speech generated localized audio
- Upload image files
- OCR text extraction using EasyOCR
- Detect image context
- Generate localized captions and UI text
- Cultural notes generation
- Python
- Flask
- HTML
- CSS
- JavaScript
- Jinja2 Templates
- Google Gemini API
- EasyOCR
- Pillow (PIL)
- OpenCV
- SpeechRecognition
- gTTS
- pydub
Automated-Content-Localization/
β
βββ app.py
βββ requirements.txt
βββ README.md
βββ .env
βββ .gitignore
βββ image.jpeg
βββ activate.bat
βββ library_list.txt
β
βββ services/
β βββ gemini_service.py
β βββ voice_service.py
β βββ visual_service.py
β βββ ocr_service.py
β
βββ templates/
β βββ home.html
β βββ text.html
β βββ visual.html
β βββ voice.html
β βββ result.html
β βββ feedback.html
β
βββ static/
β βββ css/
β βββ uploads/
β βββ outputs/
β
βββ instance/
β
βββ Visual-Backend/git clone <repository-link>
cd MINI_PROJECT_2Y-MAINpython -m venv venv
venv\Scripts\activatepython3 -m venv venv
source venv/bin/activatepip install -r requirements.txtThis project uses the Google Gemini API for AI-based localization.
Create a .env file in the project root directory.
Example:
GOOGLE_API_KEY=your_gemini_api_key_here-
Generate your API Key
-
Copy and paste it inside the
.envfile
Start the Flask application:
python app.pyOpen browser:
http://127.0.0.1:5000/- Original Text
- Target Language
- Region / Culture
- Tone
- Localized Text
- Cultural Adaptation
- Audio File / Voice Recording
- Target Language
- Region
- Tone
- Transcribed Text
- Localized Voice Output
- Generated Audio File
- Image Upload
- Optional Caption / UI Text
- Target Language
- Tone
- Extracted OCR Text
- Localized Visual Caption
- Cultural Notes
- Formal
- Casual
- Marketing
- Narrative
- Professional
- Formal
- Casual
- Professional
- Friendly
User Input
β
Gemini AI Processing
β
Localization Generation
β
Cultural Adaptation
β
Final Output
Audio Upload
β
Speech-to-Text
β
AI Localization
β
Text-to-Speech
β
Localized Voice Output
Image Upload
β
OCR Text Extraction
β
Image Context Understanding
β
AI Localization
β
Localized Visual Output
- Educational content localization
- Voice translation systems
- Multilingual customer support
- Marketing localization
- Visual content adaptation
- Regional UI localization
---"# Automated_Content_Localization"
