DATA SETS
& ANALYTICS
Explore real-world data from TriNetX in your own research environment.
Go beyond cohort counts with fine-grained, date-stamped observations on more than 110M patients around the world.
Downloadable, curated or linked data sets to power analysis. Plus, non-US, regulatory grade oncology registries to support market access and regulatory submissions.
Downloadable Data Sets
Explore the depths of patient data and discover insightful solutions with our licensed data sets. From diagnoses to genomics, each observation is linked to a pseudonymized ID, encounter, and date for robust, longitudinal insights. Our universal CSV format allows flexible analysis on your preferred application.
LUCID – Trusted Research Environment
LUCID empowers analysts to stage patient-level data seamlessly in a robust notebook environment, supporting your code without leaving TriNetX or downloading files. Explore, analyze, and model with complete control—discover a new era in research with LUCID.
TherapyMonitor Reports: Non-US Oncology Data
Tap into real-world analyses derived from longitudinal, physician-supplied data on oncology care in Europe. Unlock the power of TherapyMonitor, our comprehensive report providing in-depth analyses of current disease prevalence, treatment patterns, and market dynamics based on indication-specific patient-level data.
Your questions deserve more than just answers—they deserve solutions, and we’re here to help you discover them.
TriNetX Data Sets
Take possession of your data set in the format and environment you choose. Download CSV files from our platform or ask us to deliver the data tables via Amazon Data Exchange, Amazon S3, Snowflake, Databricks, or any other web-based data service.
Licensed Data Sets
Our largest single resource for data sets is built from EHR data enriched with labs and mortality data. Patients are from all 9 U.S. census divisions and three countries outside the U.S.
CURATED DATA SETS
Curated data sets are designed to be immediately usable. They’re built upon a normalized, standards-based terminology that is an OMOP inspired, Native TriNetX Format, or the Sentinel Common Data Model, ensuring compatibility and ease of use. These research-ready data sets include calculated, derived variables like BMI, and new therapeutic-specific tables and facts purpose-built your analysis.
LINKED
De-identified U.S. patient data, including longitudinal histories from EHR, payer-sourced medical and pharmaceutical claims (“closed claims”) along with mortality data. This dataset ensures at least one pair of insurance coverage start and stop dates for each patient, enabling comprehensive care record analysis.
MULTIPLE MYELOMA
Unlike patient registries, this data set reflects observations and treatments that precede the cancer diagnosis, as well as the full spectrum of concurrent non-hematological care, for a holistic view of each de-identified patient.
TRINETX + AWS
AWS Data Exchange makes it easy to find, subscribe to, and use third-party data in the cloud. With AWS Data Exchange, customers can integrate TriNetX data and other third-party data sources into any of their company’s software via API requests without ETL. They can simply use their AWS credentials and AWS SDK to call APIs from dozens of data providers. All data is encrypted at rest and in transit, and the service is integrated with AWS identity and Access Management (IAM) to create fine-grained controls.
DATA SET CURATION SERVICES
Navigating through the complexities of data acquisition for your research shouldn’t be a challenge. TriNetX is here to simplify the process. Let our data and platform experts handle the intricate task of curation. We’ll work closely with you to understand your specific needs and craft and refine queries across our global data network, ensuring you receive a tailored data set without the hassle.
LUCID
Access and analyze rich, secure, real-time data:
- The healthcare industry’s largest global health research network
- Data captured on-site behind a secure HCO firewall
- Cloud-based platform for instantaneous access to query and mine longitudinal clinical data
- A state-of-the-art suite of analytics generates evidence to answer complex research questions
- Provides a compliant pathway back to the patient (HIPAA, GDPR)
TherapyMonitor Reports
- Multiple Myeloma (MM)
- Diffuse Large B-Cell Lymphoma (DLBCL)
- Myelodysplastic Syndromes (MDS)
- Chronic Lymphocytic Leukemia (CLL)
- Myelofibrosis (MF)
Discover More TriNetX Solutions
Clinical Trial Design & Optimization
Insights & Real-World Evidence Generation
Leverage fit-for-purpose data analysis and methodology.
Pharmacovigilance & Drug Safety
Detect, analyze, and manage signals with world-class pharmacoepi expertise.