StepForge-DataCollection

A guide for extracting titles, authors, and citations from Google Scholar using Python and Oxylabs SERP Scraper API.
Updated 2025-12-30 22:50:12 -08:00
A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
Updated 2023-10-30 04:46:27 -07:00
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Updated 2021-06-28 12:56:23 -07:00