Private by default
Keep your work on your machine. Local models let prompts, responses, and files be processed on-device instead of being sent to a remote provider.
Run supported GGUF models with your files in a native workspace. Chats, attachments, downloaded local models, and settings stay on your device.
Aitopus supports compatible GGUF models from these families, so private requests can stay on-device when a model is installed.
Keep your work on your machine. Local models let prompts, responses, and files be processed on-device instead of being sent to a remote provider.
Once downloaded, local models are available without an internet connection. Use AI while traveling or working from restricted networks.
Run local models on your own hardware without per-token charges, API keys, or cloud billing for local chats.
Choose which models run on your device and keep your conversations on your machine.
Install, use, and delete local models from your device. Stay in control of storage, model choice, and your local setup.
Browse compatible local models in Aitopus and compare each model's strengths, size, and fit for your device.
Flagship reasoning model for coding and multilingual chat.
Dense Google model with vision and broad language support.
Lightweight model for quick local chat and reasoning.
MoE model from Z.ai for reasoning, coding, and bilingual chat.
Many local models are offered in multiple quantizations. Smaller quantizations use less memory and storage, and usually run faster. Larger ones may preserve more quality, but need more capable hardware. Aitopus helps you pick the version that fits your device.
Once the download finishes, the model appears in Aitopus alongside your other available models. Select it from the model picker to start chatting.
Use the Aitopus chat interface with a model running on your own machine.
This chat is routed to the local model selected above.
See which local models are installed, how much space they use, and remove them whenever you need the space back.
Choose how much conversation history the model can use.
Choose how much conversation history the model can use.