DiscussLLM

This directory contains the code and dataset release for DiscussLLM: Teaching Large Language Models When to Speak

Layout

src/discussllm: dataset loaders and collators used for generator and classifier fine-tuning
scripts: minimal training and turn-by-turn inference entry points
data_generation: prompts and local generation utilities
configs/deepspeed: DeepSpeed configs used by the training scripts
hf_dataset: Hugging Face dataset package with group discussions and split files

Install

mamba env create -f discussllm.yml
mamba activate discussllm
pip install -e .

Dataset

Clone the dataset in the root of the repo.

The raw discussion transcripts are in:

hf_dataset/data/generated_discussion_data

Splits:

hf_dataset/data/train_data.txt
hf_dataset/data/test_data.txt

End to End training

deepspeed scripts/train_generator.py \
  --deepspeed configs/deepspeed/zero1.json \
  --model_name_or_path meta-llama/Meta-Llama-3-8B-Instruct \
  --data_root hf_dataset/data/generated_discussion_data \
  --output_dir outputs/generator \
  --num_train_epochs 5 \
  --per_device_train_batch_size 1 \
  --per_device_eval_batch_size 1 \
  --gradient_accumulation_steps 8 \
  --gradient_checkpointing \
  --learning_rate 0.0002 \
  --bf16 \
  --tf32 \
  --use_lora

Classifier training

deepspeed scripts/train_classifier.py \
  --deepspeed configs/deepspeed/zero1.json \
  --model_name_or_path roberta-base \
  --data_root hf_dataset/data/generated_discussion_data \
  --output_dir outputs/classifier \
  --num_train_epochs 5 \
  --per_device_train_batch_size 32 \
  --per_device_eval_batch_size 32 \
  --learning_rate 0.00001

The same commands are available as:

bash scripts/train_generator_deepspeed.sh
bash scripts/train_classifier_deepspeed.sh

Zero-shot evaluation:

bash scripts/eval_zeroshot.sh

Inference

End to End:

python scripts/infer_generator.py \
  --base_model_name_or_path meta-llama/Meta-Llama-3-8B-Instruct \
  --model_name_or_path outputs/generator

Classifier:

python scripts/infer_classifier.py --model_name_or_path outputs/classifier

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs/deepspeed		configs/deepspeed
data_generation		data_generation
scripts		scripts
src/discussllm		src/discussllm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
discussllm.yml		discussllm.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiscussLLM

Layout

Install

Dataset

End to End training

Classifier training

Inference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DiscussLLM

Layout

Install

Dataset

End to End training

Classifier training

Inference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages