Abstract
The precise named entity recognition (NER) is a key component in Chinese clinical natural language processing. Although clinical NER systems have attracted widespread attention and been studied for decades, the latest NER research usually relies on a shallow text representation with one-layer neural encoding, which fails to capture deep features and limits its performance improvement. To capture more features and encode the clinical text efficiently, we propose a deep stacked neural network for Chinese clinical NER. The neural network stacks two bidirectional long-short term memory and gated recurrent unit layers to encode the text twice, followed by a conditional random fields (CRF) layer to recognize named entities in Chinese clinical text. Extensive empirical results on three real-world datasets demonstrate that the proposed method significantly outperforms six state-of-the-art NER methods. Especially compared with the conventional CRF model, our method has at least 3.75% F1-score improvement on these public datasets.
Original language | English |
---|---|
Article number | e5775 |
Pages (from-to) | 1-11 |
Number of pages | 11 |
Journal | Concurrency Computation Practice and Experience |
Volume | 33 |
Issue number | 22 |
Early online date | 28 Apr 2020 |
DOIs | |
Publication status | Published - 25 Nov 2021 |
Keywords
- Chinese clinical text
- electronic medical record
- GRU
- LSTM
- named entity recognition