A Deep Learning Based Method for Structuring the Chinese Pathological Reports of Lung Specimen

2021 
As a kind of electronic reports in text form, the Chinese pathology report of lung specimen contains a large amount of information that is important for clinicians to further analysis and mining. However, various expressions and no fixed format increases the difficulty of extracting and standardizing this information. In this paper, we focus on the extraction of lung lesion locations and the corresponding diagnosis from these reports. And to overcome the difficulties, a structured processing method based on deep learning and the idea of part-of-speech (POS) tagging was proposed. Firstly, the data of lung pathology specimen reports are preprocessed to normalize the medical terms. Secondly, the bidirectional Long Short-Term Memory (Bi-LSTM) neural network is adopted to extract the information of lesion locations and pathological diagnosis from each report. Finally, the obtained information is screened by an information filter method to generate the final structured results. Experimental results on the self-constructed datasets indicated that the proposed method can be beneficial for structuring pathology reports of lung specimen and obtained state-of-the-art results.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    12
    References
    0
    Citations
    NaN
    KQI
    []
    Baidu
    map