# View help
python main.py --help
# Run dataset with custom query type
python main.py --dataset lcr --query-type SFWData should be saved in JSON format. Text is directly written to JSON files, while other multimodal data saves paths. Column names can be modified as needed.
{
"id": "1",
"description": "your-text",
"image": "your-path-to-image"
}Queries should be saved as query/{dataset}/{query-type}/x.sql
--dataset, -d: Specify the dataset to run--all, -a: Run all datasets--query-type, -q: Override query type
Main UQE configuration parameters:
USE_BART: Whether to use BART modelBATCH_SIZE: Batch sizeBUDGET: Budget limitAGGR_STRATEGY: Aggregation strategyN_CENTROIDS: Number of clustering centroidsN_ITER: Number of iterationsGROUP_EXTRACT_SAMPLE_RATIO: Group extraction sampling ratioAGGR_CLUSTER_SAMPLE_RATIO: Aggregation cluster sampling ratio
OpenAI connection configuration:
MODEL: Selected OpenAI modelOPENAI_KEY: OpenAI API keyBASE_URL: OpenAI base URL
- Results are saved in
result/{dataset_name}/{query_type}/{timestamp}/directory - Each query result is saved as a CSV file