Skip to content

Commit 0e5bc28

Browse files
authored
Update README.md
Improve README in clarifying the role of each notebooks
1 parent b19e0c7 commit 0e5bc28

1 file changed

Lines changed: 6 additions & 6 deletions

File tree

README.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,16 @@
11
# GraphNER
22

3-
This repository contains the preliminary code for reproducing the results reported in the paper "[GraphNER: Named Entity Recognition as Graph Classification](https://openreview.net/forum?id=vfpW-kRvLgu)" (currently under review).
3+
This repository contains the code for reproducing the preliminary results reported in the paper "[Named Entity Recognition as Graph Classification](https://openreview.net/forum?id=QA_Ttxv9WfG)" (currently under review for the [ESWC 2021 Poster Track conference](https://2021.eswc-conferences.org/)).
44

55
## Overview
66
The code is organized as notebooks, to be used as follows:
7-
* `final_generate_gazetteers.ipynb`: to generate the gazeteers from Wikidata (by specifying a list of QIDs corresponding to the classes)
8-
* `edge_list_generation.ipynb`: to generate the graph structure to build the graph embeddings
9-
* `graph_embeddings_generation.ipynb`: to generate node embeddings using [GEM library](https://github.com/palash1992/GEM) algorithms (e.g. node2ve, SDNE..)
7+
* `final_generate_gazetteers.ipynb`: to generate gazeteers from Wikidata (by specifying a list of QIDs corresponding to the entity types that one wishes to extract)
8+
* `edge_list_generation.ipynb`: to generate the graph structure to build the graph embeddings; when applied to the [ConLL 2003 train dataset](https://www.clips.uantwerpen.be/conll2003/ner/), one should get a similar result that this [Python dict data structure](https://github.com/Siliam/graph_ner/blob/main/dataset/conll/conll_graph.pickle)
9+
* `graph_embeddings_generation.ipynb`: to generate node embeddings using of the algorithms (e.g. node2ve, SDNE..) provided by the [GEM library](https://github.com/palash1992/GEM)
1010
* `nodes_classifier.ipynb`: to train a model for the node embeddings
1111
* `autoencoder_embeddings.ipynb`: to generate auto-encoder embeddings from the binary graph representations
1212
* `autoencoder_nn_classification.ipynb`: to train a model for the auto-encoder embeddings
13-
* `GCN-node-simple-features.ipynb`: to train a GCN on the CoNLL-2003 task
13+
* `GCN-node-simple-features.ipynb`: to train a GCN on the CoNLL-2003 dataset
1414

1515
The code will be streamlined into stand-alone configurable scripts and fully documented soon.
1616

@@ -27,7 +27,7 @@ The code will be streamlined into stand-alone configurable scripts and fully doc
2727
:warning: This code runs on a CUDA11.0-enabled GPU, please install the compatible version of the modules for your hardware.
2828

2929
## Results
30-
The table below shows the performance of different models on the validation set of CoNLL-2003
30+
The table below shows the performance of different models on the validation set (dev) of [CoNLL-2003](https://www.clips.uantwerpen.be/conll2003/ner/)
3131

3232
Method | Accuracy | Micro-F1 | Macro-F1
3333
-----------------|----------|----------|---------

0 commit comments

Comments
 (0)