📰🚀 Advanced News Scraper with AI 🧠 — auto-collects headlines from BBC, CNN, Reuters 🌍. Features keyword filter 🔍, summaries 📝, topic tagging 🏷️, scheduling ⏰, email alerts 📧, GUI 💻, web dashboard 🌐, API 📡, and safe scraping 🔄. Export CSV/JSON/TXT 📂.
Welcome to the ultimate news scraping toolkit!
This project empowers you to collect, analyze, and interact with the latest headlines from top news sources—automatically and intelligently.
git clone https://github.com/yourusername/news_scraper.git
cd news_scraper
pip install -r requirements.txt
python -m spacy download en_core_web_sm
config.yaml to set up your news sources, email, push notifications, and scheduler.python main.py --site bbc --keyword "AI" --output ai_headlines.json --format json
python main.py --schedule
python -m gui.app
python -m web.dashboard
python -m api.server
python main.py --help
| Option | Description | Example |
|---|---|---|
| –site | News source (bbc, cnn, reuters, etc.) | –site cnn |
| –keyword | Filter by keyword | –keyword “election” |
| –output | Output filename | –output results.csv |
| –format | Output format: text, csv, json | –format csv |
| –max | Max number of headlines | –max 30 |
| –summarize | Summarize headlines (AI) | –summarize |
| –schedule | Run as scheduled task | –schedule |
python -m gui.apppython -m web.dashboardpython -m api.serverMIT License. Free for personal and commercial use.
Happy Scraping! ✨
We use cookies
We use cookies to analyze traffic and improve your experience. You can accept or reject analytics cookies.