Fine-Tuning Gemma for Chinese Poetry Generation

Welcome to the Gemma Fine-Tuning Project, designed to adapt the Gemma model for generating classical Chinese poetry. This repository documents a technical deep dive into fine-tuning Gemma using LoRA (Low-Rank Adaptation), optimizing for computational efficiency and stylistic accuracy.

Project Overview

Objective

The primary goal of this project is to fine-tune the Gemma model to generate poetry in a classical Chinese style. This involves:

Preparing a tokenized dataset of classical Chinese poetry.
Employing the LoRA framework for lightweight fine-tuning.
Evaluating model performance against stylistic and linguistic metrics.

Key Features

Low Computational Overhead: Leveraging LoRA for efficient model adaptation with minimal hardware requirements.
Cultural Nuance: Tailoring outputs to reflect classical Chinese literary traditions.
Scalable Workflow: Designed for extension to other languages or stylistic domains.

Installation

Ensure you have a compatible Python environment (Python 3.8+ recommended). Install the required libraries by executing:

pip install -q -U keras-nlp datasets
pip install -q -U keras

Dependencies include:

keras-nlp: For natural language processing tasks.
datasets: For efficient dataset loading and management.
keras: For model definition and training.

Dataset

The dataset used in this project is a curated collection of classical Chinese poetry. It has been processed into a tokenized format compatible with the fine-tuning pipeline.

Dataset Preparation

Data Source: Classical Chinese poetry collections from publicly available repositories.
Preprocessing Steps:
- Tokenization using a pre-trained tokenizer.
- Splitting into training, validation, and test sets.
File Structure:
- train.txt: Tokenized training data.
- val.txt: Tokenized validation data.
- test.txt: Tokenized test data for evaluation.

Dataset Statistics

Number of Poems: ~10,000
Average Length: 100 characters per poem
Format: UTF-8 encoded text files

Model Architecture

The Gemma model is based on a transformer architecture with the following characteristics:

Number of Layers: 24
Hidden Size: 1024
Attention Heads: 16
Parameter Count: ~350M

The LoRA approach modifies only a subset of parameters, significantly reducing the computational resources required for fine-tuning.

Fine-Tuning Process

1. Data Preparation

Tokenize input data using the Gemma tokenizer.
Create a PyTorch dataset and dataloader for efficient batching.

2. Training Configuration

Optimizer: AdamW with learning rate scheduling.
Learning Rate: 5e-5 with linear decay.
Batch Size: 16
Epochs: 3
Loss Function: Cross-entropy loss.

3. LoRA Integration

Fine-tune only specific layers of the transformer to reduce resource usage.
Inject trainable low-rank matrices into the attention mechanism.

4. Training Execution

Use the following command to start the fine-tuning process:

from transformers import Trainer, TrainingArguments

training_args = TrainingArguments(
    output_dir="./results",
    evaluation_strategy="steps",
    learning_rate=5e-5,
    per_device_train_batch_size=16,
    num_train_epochs=3,
    weight_decay=0.01,
    logging_dir="./logs",
    save_steps=500,
)

trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=train_dataset,
    eval_dataset=val_dataset,
    tokenizer=tokenizer,
)

trainer.train()

Evaluation

Metrics

Perplexity: Measures the model's predictive performance.
BLEU Score: Evaluates similarity to reference poems.
Human Evaluation: Subjective assessment of stylistic fidelity.

Results

Perplexity: Achieved a score of 15.2 on the test set.
BLEU Score: Average BLEU-4 of 0.48.
Human Feedback: Rated highly on maintaining classical style.

Extensibility

This workflow can be extended to:

Other Languages: Adapting Gemma for poetry generation in different linguistic styles.
Stylistic Domains: Training on prose, modern poetry, or technical texts.
Model Variants: Experimenting with larger or smaller variants of Gemma.

Usage

Clone this repository and execute the following steps:

Clone the repository:

git clone https://github.com/your-username/gemma-chinese-poetry.git
cd gemma-chinese-poetry

Install dependencies:
```
pip install -r requirements.txt
```

Run the notebook:

jupyter notebook gemma-google-chinese.ipynb

Fine-tune the model following the notebook instructions.

Limitations

Dataset Bias: Limited to classical Chinese poetry styles.
Computational Requirements: Requires a GPU for efficient training.
Generative Quality: May require manual curation of outputs.

Contributing

We welcome contributions from the community! Here are ways to get involved:

Bug Reports: File an issue for any bugs or inconsistencies.
Feature Requests: Suggest extensions or improvements.
Pull Requests: Submit code or documentation improvements.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

Google: For organizing the competition and supporting innovation.
LoRA Framework: For enabling efficient fine-tuning of large models.
Open-Source Community: For tools and datasets that made this project possible.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE.txt		LICENSE.txt
README.md		README.md
gemma-google-chinese.ipynb		gemma-google-chinese.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-Tuning Gemma for Chinese Poetry Generation

Project Overview

Objective

Key Features

Installation

Dataset

Dataset Preparation

Dataset Statistics

Model Architecture

Fine-Tuning Process

1. Data Preparation

2. Training Configuration

3. LoRA Integration

4. Training Execution

Evaluation

Metrics

Results

Extensibility

Usage

Limitations

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning Gemma for Chinese Poetry Generation

Project Overview

Objective

Key Features

Installation

Dataset

Dataset Preparation

Dataset Statistics

Model Architecture

Fine-Tuning Process

1. Data Preparation

2. Training Configuration

3. LoRA Integration

4. Training Execution

Evaluation

Metrics

Results

Extensibility

Usage

Limitations

Contributing

License

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages