Implicit entity recognition in clinical documents

Sujan Perera; Pablo Mendes; Amit Sheth; Krishnaprasad Thirunarayan; Adarsh Alex; Christopher Heid; Greg Mott

doi:10.18653/v1/s15-1028

Implicit entity recognition in clinical documents

Sujan Perera, Pablo Mendes, Amit Sheth, Krishnaprasad Thirunarayan, Adarsh Alex, Christopher Heid, Greg Mott

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

9 Scopus citations

Abstract

With the increasing automation of health care information processing, it has become crucial to extract meaningful information from textual notes in electronic medical records. One of the key challenges is to extract and normalize entity mentions. State-of-the-art approaches have focused on the recognition of entities that are explicitly mentioned in a sentence. However, clinical documents often contain phrases that indicate the entities but do not contain their names. We term those implicit entity mentions and introduce the problem of implicit entity recognition (IER) in clinical documents. We propose a solution to IER that leverages entity definitions from a knowledge base to create entity models, projects sentences to the entity models and identifies implicit entity mentions by evaluating semantic similarity between sentences and entity models. The evaluation with 857 sentences selected for 8 different entities shows that our algorithm outperforms the most closely related unsupervised solution. The similarity value calculated by our algorithm proved to be an effective feature in a supervised learning setting, helping it to improve over the baselines, and achieving F1 scores of .81 and .73 for different classes of implicit mentions. Our gold standard annotations are made available to encourage further research in the area of IER.

Original language	English (US)
Title of host publication	Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015
Publisher	Association for Computational Linguistics (ACL)
Pages	228-238
Number of pages	11
ISBN (Electronic)	9781941643396
DOIs	https://doi.org/10.18653/v1/s15-1028
State	Published - 2015
Externally published	Yes
Event	4th Joint Conference on Lexical and Computational Semantics, *SEM 2015 - Denver, United States Duration: Jun 4 2015 → Jun 5 2015

Publication series

Name	Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015

Conference

Conference	4th Joint Conference on Lexical and Computational Semantics, *SEM 2015
Country/Territory	United States
City	Denver
Period	6/4/15 → 6/5/15

ASJC Scopus subject areas

Computer Networks and Communications
Computer Science Applications
Information Systems

Access to Document

10.18653/v1/s15-1028

Cite this

Perera, S., Mendes, P., Sheth, A., Thirunarayan, K., Alex, A., Heid, C., & Mott, G. (2015). Implicit entity recognition in clinical documents. In Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015 (pp. 228-238). (Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s15-1028

Implicit entity recognition in clinical documents. / Perera, Sujan; Mendes, Pablo; Sheth, Amit et al.
Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015. Association for Computational Linguistics (ACL), 2015. p. 228-238 (Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Perera, S, Mendes, P, Sheth, A, Thirunarayan, K, Alex, A, Heid, C & Mott, G 2015, Implicit entity recognition in clinical documents. in Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015. Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015, Association for Computational Linguistics (ACL), pp. 228-238, 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015, Denver, United States, 6/4/15. https://doi.org/10.18653/v1/s15-1028

Perera S, Mendes P, Sheth A, Thirunarayan K, Alex A, Heid C et al. Implicit entity recognition in clinical documents. In Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015. Association for Computational Linguistics (ACL). 2015. p. 228-238. (Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015). doi: 10.18653/v1/s15-1028

@inproceedings{022370a344664e96b11a2cc7d50ca981,

title = "Implicit entity recognition in clinical documents",

abstract = "With the increasing automation of health care information processing, it has become crucial to extract meaningful information from textual notes in electronic medical records. One of the key challenges is to extract and normalize entity mentions. State-of-the-art approaches have focused on the recognition of entities that are explicitly mentioned in a sentence. However, clinical documents often contain phrases that indicate the entities but do not contain their names. We term those implicit entity mentions and introduce the problem of implicit entity recognition (IER) in clinical documents. We propose a solution to IER that leverages entity definitions from a knowledge base to create entity models, projects sentences to the entity models and identifies implicit entity mentions by evaluating semantic similarity between sentences and entity models. The evaluation with 857 sentences selected for 8 different entities shows that our algorithm outperforms the most closely related unsupervised solution. The similarity value calculated by our algorithm proved to be an effective feature in a supervised learning setting, helping it to improve over the baselines, and achieving F1 scores of .81 and .73 for different classes of implicit mentions. Our gold standard annotations are made available to encourage further research in the area of IER.",

author = "Sujan Perera and Pablo Mendes and Amit Sheth and Krishnaprasad Thirunarayan and Adarsh Alex and Christopher Heid and Greg Mott",

year = "2015",

doi = "10.18653/v1/s15-1028",

language = "English (US)",

series = "Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015",

publisher = "Association for Computational Linguistics (ACL)",

pages = "228--238",

booktitle = "Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015",

address = "United States",

note = "4th Joint Conference on Lexical and Computational Semantics, *SEM 2015 ; Conference date: 04-06-2015 Through 05-06-2015",

}

TY - GEN

T1 - Implicit entity recognition in clinical documents

AU - Perera, Sujan

AU - Mendes, Pablo

AU - Sheth, Amit

AU - Thirunarayan, Krishnaprasad

AU - Alex, Adarsh

AU - Heid, Christopher

AU - Mott, Greg

PY - 2015

Y1 - 2015

N2 - With the increasing automation of health care information processing, it has become crucial to extract meaningful information from textual notes in electronic medical records. One of the key challenges is to extract and normalize entity mentions. State-of-the-art approaches have focused on the recognition of entities that are explicitly mentioned in a sentence. However, clinical documents often contain phrases that indicate the entities but do not contain their names. We term those implicit entity mentions and introduce the problem of implicit entity recognition (IER) in clinical documents. We propose a solution to IER that leverages entity definitions from a knowledge base to create entity models, projects sentences to the entity models and identifies implicit entity mentions by evaluating semantic similarity between sentences and entity models. The evaluation with 857 sentences selected for 8 different entities shows that our algorithm outperforms the most closely related unsupervised solution. The similarity value calculated by our algorithm proved to be an effective feature in a supervised learning setting, helping it to improve over the baselines, and achieving F1 scores of .81 and .73 for different classes of implicit mentions. Our gold standard annotations are made available to encourage further research in the area of IER.

AB - With the increasing automation of health care information processing, it has become crucial to extract meaningful information from textual notes in electronic medical records. One of the key challenges is to extract and normalize entity mentions. State-of-the-art approaches have focused on the recognition of entities that are explicitly mentioned in a sentence. However, clinical documents often contain phrases that indicate the entities but do not contain their names. We term those implicit entity mentions and introduce the problem of implicit entity recognition (IER) in clinical documents. We propose a solution to IER that leverages entity definitions from a knowledge base to create entity models, projects sentences to the entity models and identifies implicit entity mentions by evaluating semantic similarity between sentences and entity models. The evaluation with 857 sentences selected for 8 different entities shows that our algorithm outperforms the most closely related unsupervised solution. The similarity value calculated by our algorithm proved to be an effective feature in a supervised learning setting, helping it to improve over the baselines, and achieving F1 scores of .81 and .73 for different classes of implicit mentions. Our gold standard annotations are made available to encourage further research in the area of IER.

UR - http://www.scopus.com/inward/record.url?scp=84966374220&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84966374220&partnerID=8YFLogxK

U2 - 10.18653/v1/s15-1028

DO - 10.18653/v1/s15-1028

M3 - Conference contribution

AN - SCOPUS:84966374220

T3 - Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015

SP - 228

EP - 238

BT - Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015

PB - Association for Computational Linguistics (ACL)

T2 - 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015

Y2 - 4 June 2015 through 5 June 2015

ER -

Implicit entity recognition in clinical documents

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this