Turning public datasets into actionable insights for everyone through natural language queries and AI-powered analytics.
GoFetch is an interactive web platform that democratizes access to public datasets by integrating AI, MongoDB's advanced search capabilities, and modern web technologies. Our mission is to transform raw data into actionable knowledge for everyone, from researchers to everyday citizens.
Live Demo: [Your deployed URL here]
Public datasets hold immense potential to inform decisions and drive change, but they are often underutilized due to:
- Complex data analysis tools requiring technical expertise
- Lack of intuitive interfaces for non-experts
- Limited leverage of AI and advanced search technologies
- Barriers for users without data science backgrounds
GoFetch bridges this gap by providing:
- Natural Language Querying: Ask questions in plain English—no coding required
- Real-Time Insights: Always see the latest data and trends
- AI Predictive Analytics: Get forecasts and recommendations to inform decisions
- Vector Search: Uncover hidden patterns using MongoDB's advanced capabilities
Our MVP focuses on India's air quality dataset from data.gov.in to address pressing public health concerns.
🗺️ Interactive Map Visualization
- Real-time air quality data across regions
- Color-coded AQI markers for instant assessment
- Historical trend analysis and time-based filtering
🔍 Natural Language Search
- Query data using conversational language
- Example: "Show me air quality in Delhi last year"
- Intelligent query interpretation and processing
🤖 AI-Powered Predictions
- Forecast future air quality levels
- Machine learning models for trend analysis
- Health impact assessments and recommendations
📱 Responsive Web Interface
- Mobile-first design approach
- Intuitive navigation for all user types
- Real-time data updates and notifications
- React.js with TypeScript for type-safe, component-based UI
- Leaflet.js for interactive mapping and geospatial visualization
- CSS3 with responsive design principles
- Node.js with Express.js for efficient API management
- MongoDB with Atlas Search for flexible data storage and vector search
- TensorFlow.js for lightweight AI capabilities
- Google Cloud Storage for scalable data hosting
- MongoDB Vector Search for semantic data discovery
- Real-time APIs for live environmental data
- TypeScript for enhanced code quality
- Environment-based configuration
- CORS and security middleware
- Comprehensive error handling
- Global Market: Data analytics market expected to reach $132.9 billion by 2026
- Indian Context: 5,000+ datasets available on India's Open Data Portal
- Target Users: Researchers, policymakers, citizens, and environmental advocates
- No-Code Data Exploration: Transform complex queries into simple conversations
- AI-Enhanced Insights: Predictive analytics without technical expertise
- Vector-Powered Discovery: Find hidden patterns and correlations
- Real-Time Processing: Always current, always relevant data
- Node.js (v14 or higher)
- npm or yarn package manager
- MongoDB Atlas account (for full NLP features)
-
Clone the repository
git clone https://github.com/yourusername/gofetch.git Set-Location gofetch
-
Install dependencies
# Install backend dependencies Set-Location backend npm install # Install frontend dependencies Set-Location ..\frontend npm install
-
Environment Configuration
# Backend environment variables Set-Location ..\backend Copy-Item .env.example .env # Edit .env file with your API keys and MongoDB connection string
-
Start the application
# Start backend server (from backend directory) npm run dev # In a new PowerShell window, start frontend (from frontend directory) Set-Location ..\frontend npm start
-
Access the application
- Frontend:
http://localhost:3000 - Backend API:
http://localhost:5000
- Frontend:
GET /api/air-quality/search?query={location}
GET /api/air-quality/stations
GET /api/predictions/{location}
POST /api/nlp/query
- Air quality data visualization
- Interactive mapping interface
- Basic search functionality
- Real-time data integration
- Natural language query processing
- MongoDB Atlas Search integration
- AI prediction models
- Vector search capabilities
- Multi-dataset expansion (healthcare, education, finance)
We welcome contributions! Please follow these steps:
- Fork the repository
- Create a feature branch:
git checkout -b feature/amazing-feature
- Commit your changes:
git commit -m 'Add amazing feature'
- Push to the branch:
git push origin feature/amazing-feature
- Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.