Deploying and Testing Gemma 4 Locally with GPUStack: A Multimodal Agent Capability Guide

The recent release of Gemma 4 introduces models that compete with Qwen 3.5, offering enhanced reasoning, native multi-modal understanding, and agentic features like tool calling and structured output. The model family supports text, image, video, and audio inputs with a 128K-256K context window, depending on the variant. This walkthrough covers ...

Posted on Sat, 13 Jun 2026 17:26:06 +0000 by teongkia

Local Deployment of FastGPT + One-API + ChatGLM3-6B Knowledge Base

1. Local Model Setup Required Repositories & Resources: ChatGLM3-6B code: https://github.com/THUDM/ChatGLM3 FastGPT: https://github.com/labring/FastGPT Vector model M3E (deploy via Docker): https://huggingface.co/moka-ai/m3e-base Anaconda3 installer: https://mirrors.tuna.tsinghua.edu.cn/anaconda/archive/ Create Conda Environment conda cre ...

Posted on Fri, 15 May 2026 00:19:05 +0000 by DaveEverFade

Local API Invocation for ChatGLM3-6B

ChatGLM3-6B Local API Invocation Method Starting the Local ChatGLM3-6B Model 1. Create a New Conda Environment Execute the following commands to create a new Conda environment: conda create -n chatglm3-demo python=3.11 conda activate chatglm3-demo 2. Navigate to the openai_api_demo Module and Run the Following Code: python api_server.py If yo ...

Posted on Sun, 10 May 2026 17:53:22 +0000 by jamesflynn