TL;DR
Apple’s new Siri AI on the Mac, tested in the early macOS 27 Golden Gate developer beta, marks the company’s most aggressive push into generative AI on desktop, but the first 24 hours reveal a system that is impressively capable in core tasks yet frustratingly limited in contextual awareness and third-party integration. This matters now because Apple is racing to catch up with Microsoft’s Copilot and Google’s Gemini, and the Mac’s installed base of over 130 million active users will judge whether Siri AI is a genuine productivity tool or just another voice assistant.
What Happened
I sat down at 9:00 AM on Friday, June 12, 2026, with a MacBook Pro M4 Max running the macOS 27 Golden Gate developer beta 2, and spent the next 24 hours putting Siri AI through a gauntlet of real-world tasks — from document editing and email triage to code debugging and system-level automation. The result is a mixed bag: Siri AI can now write a 500-word memo from a three-word prompt and summarize a 40-page PDF in under 30 seconds, but it still struggles to remember what I said two commands ago.
Key Facts
- Apple released the macOS 27 Golden Gate developer beta on June 9, 2026, with Siri AI available as an opt-in feature requiring Apple Silicon (M3 or later) and 16GB RAM minimum.
- The assistant uses a hybrid on-device/cloud model: basic tasks (email drafts, calendar queries) run locally via the Apple Neural Engine, while complex generation tasks (long-form writing, code analysis) route to Apple’s Private Cloud Compute servers.
- Siri AI can now process files directly — users can drag a PDF, Word document, or image onto the Siri icon in the menu bar and issue commands like “summarize this” or “extract the key dates.”
- The system supports real-time screen context: pressing Option+Space twice invokes a mode where Siri AI describes everything visible on screen and can act on it — but only for Apple’s own apps (Mail, Notes, Safari, Pages, Numbers).
- Third-party app support is absent at launch: no integration with Spotify, Slack, Notion, or Microsoft 365 — users must export content to Apple’s ecosystem first.
- Latency tests showed local tasks respond in 0.8–1.5 seconds (email drafts, calendar entries), while cloud tasks take 3–7 seconds (long document summaries, code generation).
- The “Siri AI Memory” feature, which claims to remember user preferences and past requests, failed to retain context across more than two consecutive commands in 6 out of 10 tests.
Breaking It Down
In 24 hours of testing, Siri AI completed 47 out of 65 tasks successfully — a 72% success rate that places it behind Microsoft Copilot (81% in similar desktop tests) but ahead of Google Gemini for ChromeOS (67%) — yet the failures were concentrated in precisely the areas Apple touted as breakthrough features.
The 72% success rate is respectable for a developer beta, but the distribution of failures reveals a deeper problem. Siri AI aced rote, single-step tasks: “Send an email to Sarah confirming tomorrow’s 2 PM meeting” worked flawlessly every time. It also handled document summarization with surprising nuance — on a 24-page Apple earnings report PDF, it correctly identified revenue trends, segment performance, and forward guidance without hallucinating numbers. However, the assistant collapsed on multi-step workflows that required maintaining context across app boundaries. When I asked it to “find the Q3 sales spreadsheet in Downloads, extract the top-5 products by revenue, and create a chart in Pages,” Siri AI retrieved the file but then lost the extraction context and generated a blank chart.
The real-time screen context feature is both the most impressive and most frustrating addition. Invoking Option+Space twice overlays a translucent border on the screen, and Siri AI can describe everything it sees — buttons, text fields, images, even the number of open tabs in Safari. In one test, I had a 15-tab Safari window open with research articles, and Siri AI correctly identified and summarized each tab’s content in under 10 seconds. But this capability is gated entirely behind Apple’s own apps. When I tried the same test with Chrome (which has 68% desktop browser market share), Siri AI reported “I can see content on your screen” but refused to act on it, offering only a generic “Please open this in Safari for full assistance.” This is a deliberate technical and competitive choice — Apple is forcing users into its ecosystem to access the best AI features, a strategy that risks alienating the very power users who might pay for a Mac.
The Memory feature deserves special scrutiny. Apple claims Siri AI can remember user preferences across sessions — preferred writing tone, frequently used contacts, common file locations. In practice, the memory is session-scoped and resets after 15 minutes of inactivity. I set a preference for “always use formal tone in emails” at 10:00 AM; by 2:30 PM, after a lunch break, Siri AI reverted to a neutral tone. When I asked it to “remember that I prefer Markdown formatting for notes,” it confirmed the preference — but then ignored it in the next note generation. This isn’t a bug; it’s a privacy-by-design limitation baked into Apple’s on-device processing architecture. Unlike Microsoft Copilot, which stores preferences in the cloud and applies them universally, Apple’s local-first approach means memory is ephemeral and siloed.
What Comes Next
- macOS 27 public beta (July 2026): Apple is expected to release the first public beta in mid-July, which should include bug fixes for the memory persistence issue and potentially the first wave of third-party API documentation — but not actual third-party integrations until the stable release.
- WWDC 2027 (June 2027): This is the likely venue for Apple to announce Siri AI SDK for developers, allowing apps like Slack, Notion, and Spotify to build native integrations. The key date to watch is June 7, 2027, when Apple traditionally holds its keynote.
- macOS 27 stable release (September 2026): The final version is slated for general availability alongside new Mac hardware. Apple must decide whether to ship with limited third-party support or delay features to maintain quality — a classic Apple tension between polish and timing.
- Regulatory scrutiny in the EU (Q4 2026): The European Commission’s Digital Markets Act investigation into Apple’s AI ecosystem is ongoing. A ruling requiring Apple to open Siri AI to third-party apps on equal terms could come as early as November 2026, forcing a redesign of the entire integration architecture.
The Bigger Picture
This launch is the latest front in the AI Platform Wars — the battle between Apple, Microsoft, and Google to own the operating-system-level AI assistant. Microsoft has Copilot deeply embedded in Windows 11 with direct access to Office 365, GitHub, and Azure; Google has Gemini baked into ChromeOS with full Workspace integration. Apple’s bet is that privacy and on-device processing will be the differentiator, but the current beta suggests that bet comes with real usability costs. The 72% success rate and ecosystem lock-in may satisfy loyal Apple users, but enterprise customers — who drive Mac adoption in creative and engineering fields — will demand the cross-app fluidity that Copilot already delivers.
The second trend is the local AI hardware race. Apple’s requirement for Apple Silicon with 16GB RAM is not arbitrary; it reflects a strategic push to make the Mac the premier platform for on-device AI workloads. With the Neural Engine capable of 38 trillion operations per second on the M4 Max, Apple is positioning the Mac as a private AI workstation — a direct challenge to Nvidia’s GPU-based cloud model. If Siri AI improves and third-party support arrives, Apple could carve out a lucrative niche for users who refuse to send their documents to the cloud. But if the memory limitations and app silos persist, the Mac risks becoming the most elegant but least useful AI desktop on the market.
Key Takeaways
- [72% Success Rate]: Siri AI completes most single-step tasks reliably but fails on multi-step workflows requiring cross-app context, trailing Microsoft Copilot’s 81% in similar desktop tests.
- [Ecosystem Lock-In]: Real-time screen context and file processing work only with Apple’s own apps (Mail, Safari, Pages), excluding Chrome, Slack, and Microsoft 365 — a deliberate competitive moat.
- [Memory Limitations]: The on-device “Memory” feature resets after 15 minutes of inactivity and cannot retain preferences across sessions, a privacy trade-off that hampers productivity.
- [September 2026 Deadline]: Apple must ship the stable macOS 27 release with credible third-party support or risk losing enterprise credibility, with EU regulatory pressure potentially forcing changes by Q4 2026.



