Skip to content

piratos/huggingface-vscode-endpoint-server

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hugging Face VSCode Endpoint Server

starcoder server for huggingface-vscdoe custom endpoint.

Can't handle distributed inference very well yet.

Fork

This fork:

  • Refactor the generator codes to separate classes
  • Adds support for starcoder under ct2fast conversion for faster inference on consumer hardware
  • Has a support vs code extension for triggered code completion see vstarcoder

PS: Rationale for not using huggingface-vscode explained in vstarcoder extension readme

Usage

pip install -r requirements.txt
python main.py

Fill http://localhost:8000/api/generate/ into Hugging Face Code > Model ID or Endpoint in VSCode.

API

curl -X POST http://localhost:8000/api/generate/ -d '{"inputs": "", "parameters": {"max_new_tokens": 64}}'
# response = {"generated_text": ""}

About

starcoder server for huggingface-vscdoe custom endpoint (Ct2fast version)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%