data engineering · 2024

County Clerk Research Pipeline

Playwright scraping, PDF ingestion, and TRC data cleaning for Texas mineral rights research

The source layer for Ranger Discovery. Built to automate the document collection side of mineral rights research, scraping county clerk portals, ingesting and parsing PDFs, and processing Texas Railroad Commission records into a usable dataset.

Have a workflow like this?

If something here looks like a problem you're dealing with, let's talk through it, no pressure, no agenda.

Book a free call

County Clerk Research Pipeline

County Clerk Scraping

PDF Ingestion & Parsing

Texas Railroad Commission Data

Skills

Have a workflow like this?