Skip to main content
SHARE
Publication

Inverse Regression for Extraction of Tumor Site from Cancer Pathology Reports...

by Abhishek K Dubey, Hong Jun Yoon, Georgia Tourassi
Publication Type
Conference Paper
Journal Name
IEEE-EMBS International Conference on Biomedical and Health Informatics 2019
Book Title
2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)
Publication Date
Page Numbers
1 to 4
Conference Name
IEEE EMBS International Conference on Biomedical & Health Informatics (IEE-EMBS BHI 2019)
Conference Location
Chicago, Illinois, United States of America
Conference Sponsor
IEEE-EMBS
Conference Date
-

Pathology reports are the primary source of information for cancer diagnosis of millions of the cancer patients across the United States. Cancer registries label these reports every year. The coded labels incorporate pertinent information such as cancer location, behavior, and histology. This information when combined with clinical information, medical imaging and even genomic information have a great potential to fuel discoveries in cancer research. The coding process is manual and requires many human experts to label the large volume of pathology reports in a timely manner. In this study, we have developed a supervised inverse regression based auto-labeler to automate the task. The experiments were conducted on a set of 942 pathology reports with human expert labels as the ground truth. We observed that the inverse regression based auto-labeler consistently performed better than or comparable to the best performing state-of-the-art method. These results demonstrate the potential of inverse regression for reliable information extraction from the pathology reports.