Put fullData.json at the root directory, along with the scripts Then run docMatrix.py to generate docs.json, which is the json file containing the tokenized abstract data of all files Do not remove the output file, it will be used in the second script Finally, run distMatrix.py to generate dist.json, which is the json file containing the final distance table
JerryKou12138/JaccardDist
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|