Gemma 4 on iPhone

(apps.apple.com)

75 points | by janandonly 1 hour ago

8 comments

pmarreck 40 minutes ago
Impressive model, for sure. I've been running it on my Mac, now I get to have it locally in my iPhone? I need to test this. Wait, it does agent skills and mobile actions, all local to the phone? Whaaaat? (Have to check out later! Anyone have any tips yet?)
I don't normally do the whole "abliterated" thing (dealignment) but after discovering https://github.com/p-e-w/heretic , I was too tempted to try it with this model a couple days ago (made a repo to make it easier, actually) https://github.com/pmarreck/gemma4-heretical and... Wow. It worked. And... Not having a built-in nanny is fun!
It's also possible to make an MLX version of it, which runs a little faster on Macs, but won't work through Ollama unfortunately. (LM Studio maybe.)
Runs great on my M4 Macbook Pro w/128GB and likely also runs fine under 64GB... smaller memories might require lower quantizations.
I specifically like dealigned local models because if I have to get my thoughts policed when playing in someone else's playground, like hell am I going to be judged while messing around in my own local open-source one too. And there's a whole set of ethically-justifiable but rule-flagging conversations (loosely categorizable as things like "sensitive", "ethically-borderline-but-productive" or "violating sacred cows") that are now possible with this, and at a level never before possible until now.
Note: I tried to hook this one up to OpenClaw and ran into issues
To answer the obvious question- Yes, this sort of thing enables bad actors more (as do many other tools). Fortunately, there are far more good actors out there, and bad actors don't listen to rules that good actors subject themselves to, anyway.
[-]
- barbazoo 0 minutes ago
  > And there's a whole set of ethically-justifiable but rule-flagging conversations (loosely categorizable as things like "sensitive", "ethically-borderline-but-productive" or "violating sacred cows") that are now possible with this, and at a level never before possible until now.
  I checked the abliterate script and I don't yet understand what it does or what the result is. What are the conversations this enables?
- magospietato 19 minutes ago
  Haven't built anything on the agent skills platform yet, but it's pretty cool imo.
  On Android the sandbox loads an index.html into a WebView, with standardized string I/O to the harness via some window properties. You can even return a rendered HTML page.
  Definitely hacked together, but feels like an indication of what an edge compute agentic sandbox might look like in future.
- c2k 26 minutes ago
  I run mlx models with omlx[1] on my mac and it works really well.
  [1] https://github.com/jundot/omlx
- jackp96 26 minutes ago
  [flagged]
  [-]
  - potsandpans 20 minutes ago
    I'm tired of this concern trolling.
PullJosh 36 minutes ago
This is awesome!
1) I am able to run the model on my iPhone and get good results. Not as good as Gemini in the cloud, but good.
2) I love the “mobile actions” tool calls that allow the LLM to turn on the flashlight, open maps, etc. It would be fun if they added Siri Shortcuts support. I want the personal automation that Apple promised but never delivered.
3) I am so excited for local models to be normalized. I build little apps for teachers and there are stringent privacy laws involved that mean I strongly prefer writing code that runs fully client-side when possible. When I develop apps and websites, I want easy API access to on-device models for free. I know it sort of exists on iOS and Chrome right now, but as far as I’m aware it’s not particularly good yet.
TGower 8 minutes ago
These new models are very impressive. There should be a massive speedup coming as well, AI Edge Gallery is running on GPU, but NPUs in recent high end processors should be much faster. A16 chip for example (Macbook Neo and iphone 16 series) has 35 TOPS of Neural Engine vs 7 TFLOPS gpu. Similar story for Qualcomm.
[-]
- api 1 minute ago
  That’s nuts actually for such a low power chip. Can’t wait to see the M series version of that.
  I’m sure very fast TPUs in desktops and phones are coming.
jeroenhd 27 minutes ago
English version of the page: https://apps.apple.com/us/app/google-ai-edge-gallery/id67496...
Also on Android: https://play.google.com/store/apps/details?id=com.google.ai....
It's a demo app for Google's Edge project: https://ai.google.dev/edge
iamdamian 22 minutes ago
This is very cool. I'm excited to see how far you can go with local models (including renting GPUs to run gemma4-26 and 31 unquantized).
hadrien01 47 minutes ago
Is it me or does the App Store website look... fake? The text in the header ("Productiviteit", "Alleen voor iPhone") looks pixelated, like it was edited on Paint, the header background is flickering, the app icon and screenshots are very low quality, the title of the website is incomplete ("App Store voor iPho...")
[-]
- giarc 42 minutes ago
  It's the dutch version, see /nl/ in the url.
  If you just go to https://apps.apple.com/ it does look better, but I agree, still a bit "off".
- throwatdem12311 43 minutes ago
  Issues caused by a low effort localization?
  On my iPhone it opens on the App Store app, so it looks fine to me.
- piperswe 44 minutes ago
  What browser are you using? I don't see any of this behavior on Firefox...
  [-]
  - hadrien01 39 minutes ago
    Firefox on Windows, but it looks about the same in Edge
    Screenshot of the header: https://i.imgur.com/4abfGYF.png
    [-]
    - t-sauer 5 minutes ago
      Renders equally weird for me on Firefox on Windows 11. Firefox on MacOS looks good though.
    - morpheuskafka 19 minutes ago
      It looks like there is some sort of glow effect on the text that isn't rendering right on your browser? It arguably doesn't have the best contrast, but seems to be as intended in Safari 26.3. Looks similar on Chrome macOS too: https://imgur.com/yq5PrKm.
- j0hax 43 minutes ago
  Everything renders crystal clear with Firefox on GrapheneOS.
- ezfe 34 minutes ago
  Nothing weird on my side
carbocation 17 minutes ago
It would be very helpful if the chat logs could (optionally) be retained.
darshil2023 1 hour ago
[dead]