SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding

Accepted by COLING2025

Overview

We introduce SeeD, a novel and efficient inference framework to optimize runtime speed and GPU memory management concurrently. By employing a scheduled speculative execution, SeeD efficiently handles multiple iterations for the thought generation and the state evaluation, leveraging a rounds-scheduled strategy to manage draft model dispatching.

Environment Setup

transformers >= 4.35.0
Python version >= 3.9
PyTorch version >= 1.11.0

Quick Start (One Command)

python src/run_example.py

Key Options

--draft_model_path          Draft model path.
--target_model_path         Target model path.
--tokenizer_path            Tokenizer path. If not provided, use the Draft/Target model path.
--num_thoughts              the num of draft models/ToT thoughts
--muti_candidate            use muti-candidate sd.
--k-config                  Use comma separations, e.g. `--k-config 2,1,1`.
--replacement               Sampling with replacement.(Flase-MCSD, True-SD)
--temperature               0.0, 0.2, 1.0, ...

TODO

The code is still being organized.🚧

Provide a simplified version to help understand the core principle, with the goal of transitioning to general tasks. (In this version, the Tree of Thoughts (ToT) has a single root node, with a depth of 2, generating 3 thoughts and 3 evaluations at once. Draft model-LLAMA-68M-Chat, Target model-LLAMA2-7b-chat)
reorgnize the code for better using experience
support other models

Citation

If you find our work valuable, we would appreciate your citation: 🎈

@article{wang2024seed,
  title={SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding},
  author={Wang, Zhenglin and Wu, Jialong and Lai, Yilong and Zhang, Congzhi and Zhou, Deyu},
  journal={arXiv preprint arXiv:2406.18200},
  year={2024}
}

Acknowledgements

Thanks to MCSD for providing open-source code that supported the expansion of this project.

Thanks to ToT for providing open-source code that supported the expansion of this project.

Also, thanks to the open-source implementation from Tree of Thoughts.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding

Overview

Environment Setup

Quick Start (One Command)

TODO

The code is still being organized.🚧

Citation

If you find our work valuable, we would appreciate your citation: 🎈

Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Linking-ai/SEED

Folders and files

Latest commit

History

Repository files navigation

SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding

Overview

Environment Setup

Quick Start (One Command)

TODO

The code is still being organized.🚧

Citation

If you find our work valuable, we would appreciate your citation: 🎈

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages