LangChain is a library that facilitates the development of applications by leveraging large language models (LLMs) and enabling their composition with other sources of computation or knowledge. FastChat’s OpenAI-compatible API server enables using LangChain with open models seamlessly.
--model-path
can be a local folder or a Hugging Face repo name.
See a full list of supported models here.
export FASTCHAT_WORKER_API_EMBEDDING_BATCH_SIZE=1
.