Databricks Integration
Connect to Verilex data via Delta Sharing. Data appears as native Spark/SQL tables in your Databricks workspace.
Setup
- 1. Request a Delta Sharing profile from us (contains endpoint URL + auth token).
- 2. Add the sharing profile to your Databricks workspace.
- 3. Query shared tables:
# Python
df = spark.read.format("deltaSharing").load("verilex.npi.providers")
df.filter("state = 'CA'").show()
# SQL
SELECT * FROM delta_sharing.`verilex.sec.filings`
WHERE form_type = '10-K'
ORDER BY filing_date DESC
LIMIT 100; Available shares
| Share | Table | Description |
|---|---|---|
| verilex.npi | providers | 9M+ US healthcare providers |
| verilex.sec | filings | Structured SEC EDGAR filings |
| verilex.weather | observations | Daily weather observations from 200+ NOAA stations |
| verilex.pacer | cases | Federal court case metadata |
| verilex.otc | companies | OTC companies with shell risk scores & analytics |
| verilex.trademarks | registrations | US trademarks with owner, class, and status data |
| verilex.patents | grants | US patents with assignees, inventors, and CPC classifications |
| verilex.company | profiles | Cross-dataset company profiles with SEC, patent, and trademark data |
| crypto | API-only (PostgreSQL-backed, not available via Delta Sharing) | |
Note: Databricks pricing is 1.2x the base price. Data is served from our self-hosted Delta Sharing server — no Databricks storage costs.