Web2Code

Official implementation of A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs.

Institute: Mohamed bin Zayed University of Artificial Intelligence
Resources: [Paper] [Project Page] [Web2Code Dataset][Croissant]

News

[2024/6/27] The paper and project page are released!

Evaluation Benchmark Suite

Explore our comprehensive benchmarks for evaluating webpage-related tasks.

Webpage Code Generation Benchmark

Set up your environment, generate webpage screenshots, and run evaluations efficiently. Get started here: Webpage Code Generation Benchmark

Webpage Understanding Benchmark

Find clear instructions for setting up your environment, generating outputs, and running evaluations. Begin here: Webpage Understanding Benchmark

Acknowledgments

LLaVA: the codebase we built upon. Thanks for their wonderful work.
WebSRC, WebSight, Pix2Code: some high-quality web page and HTML code related dataset！

Citation

If you find our work helpful for your research, please consider giving a star ⭐ and citation 📝

@article{web2code2024,
  title={Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs},
  journal={arXiv preprint},
  year={2024}
}

License

Usage and License Notices: Usage and License Notices: The data is intended and licensed for research use only. The dataset is CC BY 4.0 (allowing only non-commercial use) and models trained using the dataset should not be used outside of research purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets		assets
code_generation		code_generation
webpage_understanding		webpage_understanding
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web2Code

News

Evaluation Benchmark Suite

Webpage Code Generation Benchmark

Webpage Understanding Benchmark

Acknowledgments

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Web2Code

News

Evaluation Benchmark Suite

Webpage Code Generation Benchmark

Webpage Understanding Benchmark

Acknowledgments

Citation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages