Skip to content
View WeerayutBu's full-sized avatar
😄
😄

Block or report WeerayutBu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
WeerayutBu/README.md

Hi! 👋 I'm Weerayut Buaphet (วีรยุทธ บัวเพชร)

I’m a Ph.D. student in the Natural Language Processing and Representation Learning Lab (NRL) at VISTEC, Thailand, supervised by Associate Professor Prof. Dr. Sarana Nutanong and co-advised by Associate Professor. Prof. Dr. Attapol Rutherford.

PhD Thesis: Resource-Constrained Named Entity Recognition — Contributed a Thai-language dataset for fine-grained nested NER and a bilingual financial NER dataset for the stock market; analyzed the generalization of encoder-based and LLM-based NER models to unseen entity types and new domains; and addressed multilingual text normalization challenges in informal language.

Currently working on LLM-based Retrieval-Augmented Generation (RAG) systems for the medical domain, including RAG pipeline design, evaluation, and post-training optimization of LLMs as part of the ThaiLLM project.

Key Projects

  • Thai Nested NER Corpus (Published in ACL2022, Finding)
  • LLM-Augmented Prototype Representation for Few-shot Named-Entity Recognition (Published in IEEE Access)
  • MultiLexNorm++: A Unified Benchmark and a Generative Model for Lexical Normalization for Asian Languages (Accepted in TALLIP)
  • Bilingual Finance NER Dataset: Cross-lingual dataset for Thai/English (Ongoing)

Contact

🌐 Website
🔗 LinkedIn
📚 Google Scholar
📧 [email protected]


Datasets

Agentic and ​RAG system

Others:

Pinned Loading

  1. vistec-AI/Thai-NNER vistec-AI/Thai-NNER Public

    Pytorch implementation of paper: Thai Nested Named Entity Recognition

    Python 46 7

  2. vistec-AI/Bilingual-Financial-NER-Model vistec-AI/Bilingual-Financial-NER-Model Public

    Python 4

  3. LangChain LangChain Public

    Python

  4. Chat-interface Chat-interface Public

    For RAG system: Streamlit + FastAPI + LangChain.

    Python

  5. Retriever Retriever Public

    FastAPI + LlamaIndex + Reranker + Docker

    Python

  6. MultiLexNorm2026 MultiLexNorm2026 Public

    WNUT workshop: co-located with EMNLP2026

    Jupyter Notebook