Skip to content

Commit bfbb3a2

Browse files
authored
Merge branch 'dev' into data-review-tool-jenit
2 parents afafe34 + 45f5e8a commit bfbb3a2

42 files changed

Lines changed: 1666 additions & 1007 deletions

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.github/workflows/pull-request-testing.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,4 +35,4 @@ jobs:
3535
flake8 . --count --exit-zero --max-complexity=10 --max-line-length=127 --statistics
3636
- name: Test with pytest
3737
run: |
38-
pytest
38+
pytest --cov=src -v

CODE_OF_CONDUCT.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ This Code of Conduct applies both within project spaces and in public spaces whe
3434

3535
## Enforcement
3636

37-
Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by contacting the project team. The project team will review and investigate all complaints, and will respond in a way that it deems appropriate to the circumstances. The project team is obligated to maintain confidentiality with regard to the reporter of an incident. Further details of specific enforcement policies may be posted separately.
37+
Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by contacting the project team at [email protected]. The project team will review and investigate all complaints, and will respond in a way that it deems appropriate to the circumstances. The project team is obligated to maintain confidentiality with regard to the reporter of an incident. Further details of specific enforcement policies may be posted separately.
3838

3939
Project maintainers who do not follow or enforce the Code of Conduct in good faith may face temporary or permanent repercussions as determined by other members of the project's leadership.
4040

docker/data-review-tool/Dockerfile

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,8 @@ COPY ./docker/data-review-tool/requirements.txt .
88
RUN pip install --no-cache-dir -r requirements.txt
99

1010
WORKDIR MetaExtractor/
11+
12+
ENV LOG_LEVEL=DEBUG
1113
# Copy the entire repository folder into the container
1214
COPY src ./src
1315

requirements.txt

Lines changed: 34 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,41 @@
1+
12
# python version 3.10
2-
pandas==2.0.1
3-
pytest~=7.3
4-
seaborn~=0.12
5-
plotly~=5.14
6-
seqeval==1.2.2
7-
nltk==3.8.1
8-
spacy==3.5.3
3+
dash~=2.10
4+
dash_bootstrap_components~=1.4
5+
dash-testing-stub==0.0.2
6+
dash_iconify==0.1.2
7+
dash_mantine_components==0.12.1
8+
dash_player==1.1.0
9+
datasets==2.13.1
10+
docopt-ng~=0.8
11+
evaluate==0.4.0
12+
flake8~=6.0
13+
huggingface_hub==0.15.1
14+
joblib==1.2.0
15+
langdetect==1.0.9
16+
matplotlib~=3.7
17+
mlflow~=2.4
18+
mlflow_skinny~=2.3
919
nbformat~=5.8
10-
scikit-learn~=1.2
20+
nltk~=3.8
21+
numpy~=1.24
22+
pandas~=2.0
23+
percy==2.0.2
24+
plotly~=5.14
25+
psutil~=5.9
26+
pyarrow~=12.0
27+
pytest~=7.3
1128
pytest-cov~=4.0
12-
flake8~=6.0
13-
docopt-ng~=0.8
1429
python-dotenv~=1.0
15-
transformers~=4.24
30+
requests~=2.31
31+
scikit-learn~=1.2
32+
seaborn~=0.12
33+
selenium~=4.2.0
34+
sentence_transformers~=2.2
35+
seqeval==1.2.2
36+
spacy~=3.5
1637
torch~=1.12
17-
pyarrow~=12.0
38+
tqdm~=4.65
39+
transformers~=4.28
1840
# to use the spacy model for baseline NER
1941
https://github.com/explosion/spacy-models/releases/download/en_core_web_lg-3.5.0/en_core_web_lg-3.5.0-py3-none-any.whl

sample_pipeline_output.json

Lines changed: 0 additions & 166 deletions
This file was deleted.

src/data_review_tool/app.py

Lines changed: 10 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,26 @@
1+
# Author: Shaun Hutchinson, Jenit Jain
2+
# Date: 2023-06-22
13
import dash
24
from dash import dcc, html
35
import dash_bootstrap_components as dbc
46
import os
57
import zipfile
8+
import sys
69

7-
from pages.navbar import create_navbar
10+
sys.path.append(os.path.join(os.path.dirname(__file__), "..", ".."))
811

12+
from src.data_review_tool.pages.navbar import create_navbar
13+
14+
from src.logs import get_logger
15+
16+
logger = get_logger(__name__)
917

1018
app = dash.Dash(
1119
__name__,
1220
use_pages=True,
1321
external_stylesheets=[
1422
dbc.themes.BOOTSTRAP,
15-
"src/data_review_tool/assets/styles.css",
23+
os.path.join("src", "data_review_tool", "assets", "styles.css"),
1624
],
1725
title="Finding Fossils",
1826
suppress_callback_exceptions=True,

src/data_review_tool/pages/about.py

Lines changed: 8 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,18 @@
1+
# Author: Shaun Hutchinson,
2+
# Date: 2023-06-22
13
import dash
24
import sys
35
import os
4-
from pages.config import *
6+
from dash import html
7+
import os
58
import dash_player as dp
69

10+
sys.path.append(os.path.join(os.path.dirname(__file__), "..", ".."))
11+
12+
from src.data_review_tool.pages.config import *
13+
714
dash.register_page(__name__)
815

9-
from dash import dash, dcc, html, Input, Output, callback
10-
import os
1116

1217
layout = html.Div(
1318
[

0 commit comments

Comments
 (0)