Run Llama,Qwen,Gemma,Mistral, any warm/cold LLM. No GPU req.
This is open-o1 demo with improved system prompt
llama.cpp server hosting a reasoning model CPU only.
Interact with a chatbot that searches for information and reasons based on your queries
Chat with a helpful AI assistant in Chinese
Chat with PDF documents using AI
Generate detailed step-by-step answers to questions
Chat with content from any website
Chat about images by uploading them and typing questions
AutoRAG Optimization Web UI
Fast and free uncensored chatbot that just works.
Bored with typical gramatical correct conversations?
Interact with PDFs using a chatbot that understands text and images
Serverless TextGen Hub is a serverless platform designed to run advanced language models such as Llama, Qwen, Gemma, and Mistral. It allows users to deploy and interact with these models without requiring GPU support, making it accessible and cost-effective. The platform is tailored for creating customizable AI assistants that can be integrated into various applications, enabling seamless chatbot functionality and text generation capabilities.
What models are supported by Serverless TextGen Hub?
Serverless TextGen Hub supports a variety of models, including Llama, Qwen, Gemma, Mistral, and other warm/cold LLMs.
Do I need a GPU to run Serverless TextGen Hub?
No, Serverless TextGen Hub is designed to operate without requiring GPU support, making it accessible on standard computing resources.
How do I obtain API keys for the models?
API keys or model access tokens can be obtained from the respective model providers. Follow their instructions to set up and use the keys within Serverless TextGen Hub.