Health Care Industry
Industry: Email Alert RSS FeedAdvancing Practice, Instruction, and Innovation Through Informatics (APIII 2003): Scientific Session and E-Poster Abstracts
Archives of Pathology & Laboratory Medicine, Oct 2004 by Becich, Michael J, Crowley, Rebecca
Design: Pathology reports from 3 institutions were examined for commonly encountered identifiers. Regular expressions were created to find and remove these identifiers. The scrubber was run iteratively on a training set until it exhibited good scrubbing performance. One thousand eight hundred new pathology reports (600 from each institution, encompassing 3 different time frames) were then processed, and each report was reviewed manually to look for identifiers that were missed (underscrubbing). The listing of removed text was also examined to find nonidentifying text that was removed (overscrubbing).
Most RecentHealth Care Articles
Results: Approximately 33% of the pathology cases contained identifiers in the body of the report. Ninety-six percent of identifiers present in the test set were removed. The identifiers that were missed were largely institution names and foreign addresses. Of the scrubbed cases, 1.3% contained HIPAA-specified identifiers (names, accession numbers, and dates) that were missed. Outside consultation case reports typically contained numerous identifiers and were the most challenging to de-identify comprehensively. There was variation in performance among the test sets, highlighting the need for site-specific customization. Overscrubbing was more prevalent than underscrubbing, and most instances of overscrubbing were due to the extensive list of personal and location names used.
Conclusions: We conclude that our first test of this software confirms the initial hypothesis that it is possible to create robust de-identification software using open-source tools. This application is currently capable of removing the vast majority of identifying information from pathology reports, while leaving the nonidentifying text intact. While the software does not perform perfectly yet, we expect that fine-tuning of the regular expressions and expansion of the database will remove the remaining identifiers. The major sources of underscrubbing are misspellings, accession numbers with unusual formats, and unexpected or unusual proper names.
Predicting Tumor Marker Outcomes With Monte Carlo Simulations
http://65.222.228.150/ijb/ramiab.htm
Jules J. Berman, MD, PhD (bermanj@maH.nih.gov). National Cancer Institute, National Institutes of Health, Bethesda, Md.
Context: Genome and proteome research have promised a revolution in tumor diagnosis. The revolution has not arrived. In fact, only a handful of new markers have appeared in the past several years. A simple thought experiment demonstrates the problem.
In a retrospective study, Dr X demonstrated a "perfect" tumor marker that never failed to distinguish between 2 tumor variants (aggressive and indolent) with identical morphology. In this example, an aggressive variant grows 10 times as fast and metastasizes at 10 times the rate of the indolent variant with the same morphology. In a prospective trial of the same marker, 200 tumors are excised at the time of clinical detection (tumor size, 2 cm). Dr X finds that 100 of the tumors stain as "indolent variants" and 100 tumors stain as "aggressive variants." The trials monitor all 200 patients, determining survival at 5 years. At the end of the trial, there is no survival difference between patients with indolent variants and patients with aggressive variants. The marker is considered a total failure, with millions of dollars wasted on the prospective trial.
Brought to you by CBS MoneyWatch.com
- Best- and Worst-Paid College Degrees
- 6 Things You Should Never Do on Twitter or Facebook
- How Much Sleep Do You Really Need?
- 6 Big Myths about Gas Mileage
- 5 Rules for Immediate Annuities
- Death in the Family: 12 Things to Do Now
- Dumbest Things You Do With Your Money
- 6 Online Networking Mistakes to Avoid
- 401(k) Mistakes to Avoid
- 5 Economic Scenarios to Keep You Up at Night
- The Real ‘Best Places to Retire’
- Best Credit Cards for You
- 12 Tough Questions to Ask Your Parents
- The Real ‘Best Colleges’
- Home Buyer Tax Credit: How to Cash In
- Why You Shouldn't Bash Cash
- 8 Phony 'Bargains' and Better Alternatives
- Danger: 3 Debit Card Scams to Avoid
- 6 Myths About Gas Mileage
- 29 Fees We Hate Most
- Quick and Easy Ways to Boost Returns
- Best Stocks to Buy Now
- Lower Your Taxes: 10 Moves to Make Now
- New Jobs: 8 Lessons from Real-Life Career Switchers
- The New Job Market: Who Wins and Who Loses?
- Health Care Reform's Public Option: Everything You Need to Know
- Volunteer Work When Unemployed: Should You Work for Free?
- Whose Recovery Is This?
- Long-Term-Care Insurance: 4 Biggest Risks to Avoid
Content provided in partnership with
Most Recent Health Articles
Most Recent Health Publications
Most Popular Health Articles
- Make running easier: with this unique 'pose running' technique, you'll learn to actually enjoy your fat-burning sessions
- 50 home remedies that work: these safe, fast, and effective fixes will relieve what ails you - Cover Story
- Detox in 7 days: a detoux diet can help you shed up to 10 pounds and leave you feeling terrific. Our weeklong plan shows you how to lose the weight and keep it off - Cover story
- Treat sinusitis naturally: breath easy and relieve sinus pressure with these remedies - Quick Fixes and Long-Term Solutions
- All about nightshades: explore the hidden hazards of your favorite food with macrobiotic nutritionist Lino Stanchich


