data

Readme

The datasets in our experiments are derived from two sources:

the raw meta information (e.g., title, review) downloaded from Amazon review.
the preprocessed interactions (i.e., item sequences) obtained from UniSRec.

Please preprocess the reviews and records based on the scripts. Let's take the Office dataset as an example, the preprocessed dataset should be:

Office
├─title_review_summary_descroption
├──test.pkl
├──train.pkl
├──val.pkl
├─negative_title
├──user_item_negitem_nge_title_seq_test.pkl
├──user_item_negitem_nge_title_seq_train.pkl
├──user_item_negitem_nge_title_seq_val.pkl
├─Office_products_5.json
├─meta_Office_Products.json
├─Office_products_5.json
└stmap.pkl

Or you can download the processed datasets from here.

Name		Name	Last commit message	Last commit date
parent directory ..
data_t5_finetune.py		data_t5_finetune.py
meta2stmap.py		meta2stmap.py
readme.md		readme.md
review_seq_process.py		review_seq_process.py
titlerec_negative.py		titlerec_negative.py
titlerec_negative_sample.py		titlerec_negative_sample.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

Readme

FilesExpand file tree

data

Directory actions

More options

Directory actions

More options

Latest commit

History

data

Folders and files

parent directory

readme.md

Readme