GitHub - Android-PowerUser/ScreenOperator: Screen Operator - Android app that operates the screen with vision LLMs

Screen Operator

Operates the screen with AI

This Android app operates the screen with commands from vision LLMs

• Like Computer use and Operator but rather Smartphone use for Android

• Can also control the Browser like Project Mariner and Browser use

Download and install

Screen Operator v1.2.apk

Doesn't work on Android 14 and below. This will maybe fixed soon (see below)

Updates in Github are much faster than on the Play Store and have no restrictions.

Develop Screen Operator with AI

This app urgently needs an update. Google's interfaces (API's) have changed, and it should also use an omni-model that can actually see the screen content, not just receive a description. However, I am no longer active as a developer, but that doesn't mean it's over, because this app can be developed entirely by AI and therefore its development can be continued by anyone:

The code for each application is located in the /app/ folder on GitHub. To have it programmed for free, I use jules.google.com. You can connect it directly to GitHub. Gemini 2.5 Pro is free, and the limit is so high that I've never reached it. Gemini 3 Pro costs money, but you can use it for a month free and cancel before any charges apply. You should definitely go to the planning section on the rocket icon, because AI instructions surprisingly often implement something different than intended, and planning improves the performance of LLMs. Since Gemini isn't generally very good, I gave Jules the task. Jules looks at the code and tells me which files it needs to change to achieve the goal. You copy these files into lmarena.ai, select "direct," then Claude Opus 4.5 thinking, copy the task from Jules, paste the output from Claude unchanged into Jules to insert the new code snippets in the files and Gemini 2.5 Pro also very well recognizes where it belongs and automatically replaces the corresponding parts.

Jules will build the app using the instructions in the AGENTS.md file and sign it with a test signature. Jules will create a pull request, and you should even receive a E-Mail about it. In the pull request, you'll find the file. Click on the three dots on the right, then select "view file" and "view raw". Now you can download, install and test it.

Switch the task in Jules as often as possible, because the longer the task, the exponentially slower Jules becomes.

You can also build it with Github action workflows. Stay in your fork, on your user account (you won't be able to start it otherwise), on mobile, click the gear icon and then Actions, and on desktop, click Actions directly. Click Workflows, select Android build, and start your chosen branch. After about 5 minutes, your app will be ready! If compilation errors occur, copy the output to Jules. If there are no more than 5 errors, even Gemini 2.5 Pro can usually handle this very reliably. Otherwise, proceed as you would when programming: Jules, then Claude, then Jules again. After that, the app needs to be signed. I use MiXplorer and select the test signature option. The others don't always work as well or as quickly. You can install it now.

needed updates

Some models no longer work because Google is changing the interfaces (API). You can usually find the current API names at aistudio.google.com, but you might not find them all there.

In some Android versions, the app exhibits surprising errors. If you find one, please fix it.

Free omni models accessible via an API can be found here

Video

First attempt ever is recorded

Note

If you in your Google account identified as under 18, you need an adult account because Google is (unreasonably) denying you the API key.

Preview models will eventually be removed by Google and unfortunately won't be redirected to finished equivalents. If this happens, please change the API in the code.

Name		Name	Last commit message	Last commit date
Latest commit History 535 Commits
.github/workflows		.github/workflows
.google		.google
app		app
gradle/wrapper		gradle/wrapper
screenshots		screenshots
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Privacy Policy.md		Privacy Policy.md
README.md		README.md
Screenshot_20250521-095334_Screen Operator.png		Screenshot_20250521-095334_Screen Operator.png
Screenshot_20250526-192615_Screen Operator.png		Screenshot_20250526-192615_Screen Operator.png
Screenshot_20250802-230431_Screen Operator.png		Screenshot_20250802-230431_Screen Operator.png
Screenshot_20250802-231135_Screen Operator.png		Screenshot_20250802-231135_Screen Operator.png
YouCut_20250526_184250539 github.mp4		YouCut_20250526_184250539 github.mp4
app-release-signed.apk		app-release-signed.apk
app-release-signed.apk.idsig		app-release-signed.apk.idsig
build.gradle.kts		build.gradle.kts
build_and_sign.sh		build_and_sign.sh
build_and_sign_verbose.sh		build_and_sign_verbose.sh
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
local.properties		local.properties
settings.gradle.kts		settings.gradle.kts
sitemap.xml		sitemap.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Screen Operator

Operates the screen with AI

• Like Computer use and Operator but rather Smartphone use for Android

• Can also control the Browser like Project Mariner and Browser use

Download and install

Develop Screen Operator with AI

needed updates

Video

Note

About

Uh oh!

Releases 3

Uh oh!

Contributors 2

Uh oh!

Languages

Android-PowerUser/ScreenOperator

Folders and files

Latest commit

History

Repository files navigation

Screen Operator

Operates the screen with AI

• Like Computer use and Operator but rather Smartphone use for Android

• Can also control the Browser like Project Mariner and Browser use

Download and install

Develop Screen Operator with AI

needed updates

Video

Note

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Uh oh!

Contributors 2

Uh oh!

Languages