Databricks Integration

Connect to Verilex data via Delta Sharing. Data appears as native Spark/SQL tables in your Databricks workspace.

Setup

  1. 1. Request a Delta Sharing profile from us (contains endpoint URL + auth token).
  2. 2. Add the sharing profile to your Databricks workspace.
  3. 3. Query shared tables:
# Python
df = spark.read.format("deltaSharing").load("verilex.npi.providers")
df.filter("state = 'CA'").show()

# SQL
SELECT * FROM delta_sharing.`verilex.sec.filings`
WHERE form_type = '10-K'
ORDER BY filing_date DESC
LIMIT 100;

Available shares

Share Table Description
verilex.npi providers 9M+ US healthcare providers
verilex.sec filings Structured SEC EDGAR filings
verilex.weather observations Daily weather observations from 200+ NOAA stations
verilex.pacer cases Federal court case metadata
verilex.otc companies OTC companies with shell risk scores & analytics
verilex.trademarks registrations US trademarks with owner, class, and status data
verilex.patents grants US patents with assignees, inventors, and CPC classifications
verilex.company profiles Cross-dataset company profiles with SEC, patent, and trademark data
crypto API-only (PostgreSQL-backed, not available via Delta Sharing)

Note: Databricks pricing is 1.2x the base price. Data is served from our self-hosted Delta Sharing server — no Databricks storage costs.

How can we help?

Choose a category and tell us what you need.