Skip to content

sudip10028/knowledge-aware-med-classification

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

knowledge-aware-med-classification

Contains the codebase for our paper Developing Knowledge-aware Neural Models for Medical Forum Question Classification that is submitted to the "Web Conference 2021".

Proposed Knowledge-aware BERT model

Running Bert-plus models

The code files in the form of self-contained Jupyter notebooks is available at: src/ProposedKnowledgeAwareModel/

Experiments and Results

The BERT and MedBERT models were trained and evaluated on three datasets: CADEC, ICHI and OSHUMED( Datasets provided in the data directory). The accuracy scores of the baseline models and our proposed models on ICHI dataset were compared. The accuracy of the TextGCN model was 0.68 which was less than that of the BERT model(0.695) and the MedBERT model(0.706).
Also various Interpretability analysis were run on the baseline Models. LIME and SHAP interpretability was used on Logistic Regression Models to find words which affect the outcomes. Similarly we used Hierarchical Attention Networks to extract the word and sentence attentions. These were used to show to what degree these models capture medical words.

Annotated CADEC dataset

We annotate CADEC as a multi-label multi-class dataset, for the task of "Medical Forum Question Classification". Each data point is annotated by 0 and 1, across five information need categories. We also have an additional column, containing the extracted medical concepts using QuickUMLS tool. The annotated files can be found at CADEC-Annotations/

About

Contains the codebase for our paper "Developing Knowledge-Aware Neural Models for Medical Forum Question Classification"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 93.3%
  • Python 6.7%