Apple Core AI Framework

(developer.apple.com)

144 points | by hmokiguess 5 hours ago

5 comments

  • an0malous 1 hour ago
    This is why the AI companies are rushing to IPO. By the end of next year you’ll be running most of your AI on device. They have no moat, they’ve reached the limits of scaling, most of the magic can be distilled into smaller models, and they know it
    • hadlock 33 minutes ago
      Qwen's ~30B-class models are genuinely good enough for use if you can find a machine with enough memory bandwidth to run them at 30-90 tokens/second. It's been extremely telling that Qwen stopped releasing 120b class models. At some point in the next 10 years (maybe 3?) someone is going to release an Opus 4.5 class 256B model you can run locally. Right now our engineers use about $800/mo worth of opus tokens; at that rate the ROI for local LLM is ~10 months
    • cat5e 33 minutes ago
      Huzzah, they’ve lost their stranglehold. Viva la revolution!
    • sealeck 48 minutes ago
      Have we reached the limits of scaling? Sadly it appears that larger model still equals better model
      • mindwok 12 minutes ago
        I think GPT 4.5 showed that there is indeed a practical limit we're close too. That was supposedly a high-trillions of parameter model that was deprecated almost immediately because it was slow, insanely expensive, and had questionable benefits over the smaller models. Though apparently the new Mythos and whatever GPT Spud is (if it wasn't 5.5) are back up in the high trillions.
      • mikestorrent 17 minutes ago
        Well, let's not forget that text models are not the only models! Video models are much slower and need comparatively more resources, and all they can do even at that size is generate videos a few seconds long. Clearly a ton more work is going to go into those, and demand for them will probably increase as more creative tools get authored using them as a central part of the workflow. Low-res local rendering for preview might be a thing, but the lion's share of the work for high-res, near-realtime rendering is going to be done on huge clusters for a long time yet.
      • pixelready 37 minutes ago
        I think there’s still an open question around are the ultra-large next-gen models worth it? For those of us without early access to Mythos, it’s hard to verify whether it’s been held back from the public due to actually being “too dangerously powerful to release yet” as implied or because the gains aren’t outpacing the costs.
      • stogot 43 minutes ago
        It’s still diminishing returns yes? It isn’t Moore’s Law
    • ActorNightly 12 minutes ago
      Very false.

      I use small models exclusively. They aren't a replacement for large models. You need decent hardware to run those models efficiently, as smaller parameter models plain suck and are still slow on macbooks. And affordability of higher end hardware is very limited.

      Even at non VC subsidized $/token prices, its still much cheaper to run cloud based models.

      • davnicwil 4 minutes ago
        well to be fair that's right now, I think the question is what about in 6 months, 12 months, 2 years?

        Where do these improvement curves go? Does the gap close, do they intersect for practical purposes (factoring in cost etc)? Or is the local curve always just a translation of the hosted, lagging behind, or indeed does hosted just pull ahead?

        Nobody knows, but it's a very open question I feel, and it certainly appears like the answer might quite reasonably be that yes they intersect on that kind of short-ish term time horizon.

  • franze 1 hour ago
    i am more excited about the ondevice foundation model update that is coming https://developer.apple.com/documentation/updates/foundation... (not much info yet)

    but i maintain https://github.com/Arthur-Ficial/apfel so i might be biased

    • trollbridge 36 minutes ago
      Thanks for building this! Something I grab on a regular basis, especially for doing simple education of folks about the basics of using LLMs by showing something that's not just a chatbot.
    • crancher 1 hour ago
      Apfel is very useful, thanks for the effort.
      • cat5e 32 minutes ago
        I second this, I’m more excited about dumb local models than something I could never run locally.
  • MysticOracle 2 hours ago
    WWDC 2026 Core AI videos

    Meet Core AI - https://developer.apple.com/videos/play/wwdc2026/324/

    Dive into Core AI model authoring and optimization - https://developer.apple.com/videos/play/wwdc2026/325/

    Integrate on-device AI models into your app using Core AI - https://developer.apple.com/videos/play/wwdc2026/326/

  • bensyverson 3 hours ago
    Wow, this seems to be a new way to convert PyTorch models to a format that runs across CPU, GPU & Apple's Neural Engine (ANE). [0]

    Does this completely replace the previous API, CoreML? [1]

      [0]: https://apple.github.io/coreai-optimization/
      [1]: https://developer.apple.com/documentation/coreml/
    • earthnail 2 hours ago
      Yes. From the CoreAI docs:

      "If your app uses model types other than neural networks, such as decision trees or tabular feature engineering, see Core ML."

    • trollbridge 56 minutes ago
      This is just a bit exciting, although I wonder how the performance of this will stack up next to the stuff we already do with, e.g., a metal-optimised model which we then load into llama-cpp or whatever. (unsloth is a good example of doing this for you "batteries included").
    • pzo 1 hour ago
      seems they planning to replace it but overall now I'm really confused about this and mlx and coremltools. They should do better work explaining the benefits (and cons) of it and any feature parity between coreai, coreml and mlx.
    • wahnfrieden 1 hour ago
      Requires OS 27+, so CoreML is still useful for backwards compatibility.
  • criddell 23 minutes ago
    Is there something like this on Linux? For example, if I’m an application developer can I assume GNU Core AI (or whatever it is or would be called) will be there if the kernel is >= some particular version?