Skip to content

Latest commit

 

History

History
22 lines (18 loc) · 1.15 KB

File metadata and controls

22 lines (18 loc) · 1.15 KB

Text Mining

HarvestText

  • link: https://github.com/blmoistawinde/HarvestText
  • author: blmoistawinde
  • note: 一个专注无(弱)监督方法,能够整合领域知识(如类型,别名)对特定领域文本进行简单高效地处理和分析的库。

Open Semantic Search

  • link: https://github.com/opensemanticsearch
  • web: https://www.opensemanticsearch.org/
  • author: opensemanticsearch.org
  • note: Free Software for your own Search Engine, Explorer for Discovery of large document collections, Media Monitoring, Text Analytics, Document Analysis & Text Mining platform based on Apache Solr or Elasticsearch open-source enterprise-search and Open Standards for Linked Data, Semantic Web & Linked Open Data integration.

DocQuery

Open Parse

  • link: https://github.com/Filimoa/open-parse
  • author: Filimoa
  • note: a tool designed to fill this gap by providing a flexible, easy-to-use library capable of visually discerning document layouts and chunking them effectively.