Android LLM Server

Developer: CLOUDSPIRIT LTD

Category: Tools

Add App to Comparison

6.00 installs

Ratings not yet available

6.00 monthly active users

Revenue not available

Install Trends

Weekly +6.00

Steady

Monthly +6.00

Steady

Android LLM Server Summary

Android LLM Server is a mobile Android app in Tools by CLOUDSPIRIT LTD. Released in Apr 2026 (recently released ago). It has about 6.00+ installs Based on AppGoblin estimates, it reaches roughly 6.00 monthly active users . Store metadata: updated Apr 25, 2026.

Recent activity: 6.00 installs this week (6.00 over 4 weeks) View trends →

Store info: Last updated on Google Play on Apr 25, 2026 .

SDKs, Trackers & Permissions

App not yet scanned for SDKs.

App Details

Store ID: uk.co.cloudspirit.gm4

First Released: 2026-04-25

Store Last Updated: 2026-04-25

In-App Purchases: No

Ads: No

App Store

Crawl Status: Success

AppGoblin First Crawled: 2026-05-04

AppGoblin Last Crawled: 2026-05-11

Ads & App-Ads.txt

App does not have ads.

AppGoblin SDK Scans

App not yet analyzed for SDKs.

The download and scan process is automated but can require manual troubleshooting. Feel free to reach out if your request does not complete in 24hrs or if you have any questions.

0★

Ratings: 0

5★

4★

3★

2★

1★

Screenshots

App Description

Run an AI language model on your device with an OpenAI-compatible local API

Transform your Android device into a powerful local AI server with the Gemma Local API Server. This app allows you to download and host Large Language Models (LLMs) securely on your device and serve them via a local REST API that is compatible with standard OpenAI client libraries.

Whether you're an AI enthusiast, developer, or privacy advocate, you can now enjoy advanced AI completions and embeddings without relying on cloud services. Your data never leaves your device!

Key Features:

100% Offline AI: Run inference entirely on-device to ensure your prompts, data, and responses remain strictly private.
OpenAI API Compatibility: Easily plug this app into your existing LLM pipelines, chat applications, and scripts. The local API supports standard /v1/completions and /v1/embeddings endpoints.
LiteRT / TFLite Support: Engineered for mobile performance, utilizing Android's LiteRT and GPU acceleration for fast, efficient model inference.
Custom Model Management: Download supported models via URL or manage your local .tflite model files effortlessly within the app.
Customizable Settings: Fine-tune the AI output by adjusting generation parameters like Max Tokens, Top K, and Temperature directly from the dashboard.
API Key Rotation: Secure your local server by generating and rotating API access tokens at the tap of a button.
Premium Subscription: Unlock unlimited server uptime. The free version offers limited execution time per day, while the Premium tier removes all restrictions.
Why use Gemma Local API Server?

Zero Cloud Costs: Pay nothing for inference after downloading the app (or upgrading to premium).
Absolute Privacy: Perfect for sensitive tasks where data cannot be shared over the internet.
Portability: Have your personalized AI model running anywhere, even in a disconnected environment.
Getting Started:

Download a compatible model file or enter a download URL in the app.
Adjust your network settings (Host and Port).
Start the server.
Copy your Local API URL and API Key to your favorite client or terminal and start chatting!

Note: Model performance depends on the RAM and processor capabilities of your device.