Drug Safety Data Source Selector
Select Your Scenario
Recommended Data Source
Key Comparison
- 87% lab value completeness
- Cost: $1.2M-$2.5M startup
- Best for rare diseases
- 52% lab value completeness
- Cost: Low (existing systems)
- Best for large populations
When a new drug hits the market, the real test of safety doesnāt happen in a controlled clinical trial. It happens in the messy, real world-where millions of people take it daily, with different diets, other medications, genetic backgrounds, and lifestyles. Thatās where real-world evidence comes in. Two of the most powerful tools for tracking drug safety after approval are patient registries and claims data. Together, they give regulators, doctors, and drugmakers a clearer picture of whatās really happening out there-beyond the 6-month study window of a typical trial.
What Exactly Is Real-World Evidence?
Real-world evidence (RWE) isnāt guesswork. Itās data pulled from everyday healthcare systems: doctor visits, hospital stays, pharmacy fills, lab results, and even patient self-reports. The U.S. Food and Drug Administration (FDA) officially defined it in 2018 as clinical evidence derived from real-world data (RWD)-information collected outside of randomized trials. Since then, RWE has gone from a niche tool to a core part of drug safety monitoring. Between 2017 and 2021, the FDA approved 12 new drug uses where RWE played a decisive role. Five of those approvals relied directly on claims data or registry records.Registries: The Deep Dive into Patient Stories
Think of registries as detailed medical diaries for specific groups of patients. Disease registries track people with conditions like cystic fibrosis, Parkinsonās, or rare cancers. Product registries follow patients using a specific drug or device. These arenāt random surveys-theyāre structured, long-term collections of standardized information. For example, the Cystic Fibrosis Foundation Patient Registry helped spot safety signals for ivacaftor, a drug that works only for certain genetic mutations. In clinical trials, those mutations were too rare to notice any unusual side effects. But in the registry, with over 30,000 patients tracked over years, patterns emerged. Thatās the power of depth. Registries capture things claims data often misses: lab values, imaging results, patient-reported symptoms, and detailed treatment histories. According to ISPOR, registry data has 87% completeness for lab results, compared to just 52% in claims data. Thatās why theyāre especially valuable for rare diseases or complex conditions where small changes matter. But registries arenāt perfect. Theyāre expensive to run-often costing $1.2 to $2.5 million to launch and $300,000 to $600,000 a year to maintain. Participation rates hover between 60% and 80%, which can introduce selection bias. And nearly one in three academic registries shut down within five years due to funding gaps. Still, when theyāre well-run, theyāre among the most trusted sources for regulatory decisions.Claims Data: The Big Picture from Billing Records
Claims data is what happens behind the scenes when a doctor bills an insurance company. Itās not clinical notes-itās codes: ICD-10 for diagnoses, CPT for procedures, NDC for medications. Itās dry, but itās massive. IBM MarketScan tracks 200 million lives. Optum covers 100 million. Medicare claims alone span 15+ years of data for millions of elderly patients. This scale makes claims data perfect for spotting rare side effects. If a drug causes a heart rhythm problem in 1 in 10,000 people, you need huge numbers to see it. In 2015, the FDA analyzed 1.2 million Medicare beneficiaries to check if entacapone increased cardiovascular risk. They found no link. In 2014, they used 850,000 records to study olmesartan in diabetics-again, finding no clear danger. Claims data is also incredibly complete for basic healthcare use. Inpatient care is tracked at 95-98% accuracy. Thatās why itās the go-to for tracking hospitalizations, ER visits, or medication adherence after a drug launches. But hereās the catch: claims data doesnāt tell you why a patient was hospitalized. Was it the drug? An infection? A fall? It lacks clinical context. Lab values are only 45-60% complete. Patient-reported symptoms? Almost never there. And diagnosis codes can be wrong-up to 20% error rates, according to AHRQ. Thatās why many signals from claims data turn out to be false alarms.
Registries vs. Claims Data: When to Use Which
| Feature | Registries | Claims Data | |--------|------------|-------------| | Data Depth | High: labs, imaging, symptoms | Low: mostly codes and billing info | | Population Size | 1,000-50,000 patients | 100 million+ patients | | Longitudinal Coverage | 5-15 years (if funded) | 15+ years (Medicare) | | Completeness of Lab Values | 87% | 52% | | Rare Event Detection | Needs ~500,000 records | Needs ~1 million records | | Cost to Build | $1.2M-$2.5M upfront | Low (uses existing systems) | | Best For | Rare diseases, complex outcomes, long-term safety | Large populations, hospitalizations, medication use | For oncology drugs, registries dominate-38% of RWE submissions use them. Thatās because cancer treatments are complex, and outcomes like progression-free survival or quality of life need detailed tracking. In cardiovascular drugs, claims data leads at 45% of submissions. Why? Because heart attacks, strokes, and hospital readmissions are well-coded in billing systems. The smartest approach? Use both. The International Council for Harmonisation (ICH) E2 proposal from June 2023 says combining the two reduces false positive safety signals by 40%. A claim shows a spike in liver enzyme reports. A registry confirms those patients had no other liver disease, were on the same dose, and had matching lab results. Thatās how you move from suspicion to certainty.Real-World Evidence in Action: Success Stories
In 2017, the FDA approved an expanded use of pembrolizumab (Keytruda) based on registry data from an expanded access program. Patients outside clinical trials were getting the drug, and their outcomes were recorded in real time. That data helped confirm it worked in more patient groups than originally tested. In 2021, the European Medicines Agency approved a new use for tacrolimus (a transplant drug) using data from the Scientific Registry of Transplant Patients. That registry tracked outcomes for over 100,000 transplant recipients-something no trial could ever replicate. And in 2019, palbociclibās approval for a new patient group relied on claims data, electronic health records, and safety reports. The FDA needed to know if it was safe for older patients with multiple health problems. Claims data showed no increased risk of blood clots or hospitalizations in that group-enabling a broader label.