Tag Archives: Voice

Hey Siri on new AirPods is Fantastic

I’m just going to say up front that I am a big fan Hey Siri generally, but it’s especially nice when accessed via AirPods and HomePods too. The first version of AirPods were excellent and I ordered them the minute they were available, aesthetics be damned. People joked about how they looked but I wore them out in my small rural town without hesitation because these little white buds were the future (and honestly, I just don’t care much about what people think of me). I wore them on errands, walks, all the time. The one thing I hoped for with the next version was Hey Siri and we got it.

It’s been a week and I just want to praise these great little things that are now even better. I mean, you know, before I had to lift my finger up and double tap. I almost sprained my finger every time! Wink. But really, as easy as that was, I have to say that this IS an improvement. Just as with the HomePod, speaking out Hey Siri and getting a nearly instant response is pretty fantastic and less error prone. In fact, while I’d occasionally have misses with my double taps (worse in winter with hoods and hats in the way), the new AirPods have picked up every single request.

A bonus is that Siri on my iPad, my main device, is now useful again. Before the HomePod always picked up “Hey Siri” which was fine for most things. And while activating the Siri button on the 2018 iPad Pro isn’t difficult I found that I used Siri less without the Home Button activation. Yet another bonus, with the AirPods connected to my iPad I can use Hey Siri even while enjoying music on the HomePods or watching a movie with loud audio through the HomePods. The mics on the AirPods are excellent and audio just above a whisper will do the trick. And in the hierarchy of Hey Siri devices they seem to rule above the HomePod which is what I want when I’m actively using them.

Two+ years in with AirPods, a year with HomePod, and Hey Siri still seems like science fiction to me. I use it constantly and still crack a smile half of the time. I’m used to it but still surprised, comfortable but still delighted. We’re not yet at Star Trek Enterprise voice interactions but I’ll happily use what we have.

My favorite daily interactions with AirPods and HomePods are the usual things I suspect most people use (assuming they’re in the habit of using it at all):

• Controlling home kit devices. This is an all the time thing for lights, plugs, checking the temp in or outside.
• Adding reminders, especially shopping list items. But I do all reminders with Siri.
• Adding calendar events. I can’t think of the last time I’ve done this with a keyboard.
• Sending texts while walking.
• Initiating knowledge/image/web searches.
• Searching my own photos.
• Getting the weather forecast.
• Controlling playback of music and podcasts.
• Maps/Local business calls/open/hours info.

Probably the only remaining roadblock is my own mind. For certain tasks I find that I still tend to work visually, for example, choosing music. I’m just not great at remembering albums names though requesting a couple of frequent playlists and artists is something I do. Also, I’d like to get better at composing vocally. I’ve been using a screen and keyboard so long that dictating anything beyond a sentence or two really requires a different kind of process though it’s fun to try. But for the list of quick short tasks above, is a real pleasure to use.

HomePod: Sometimes great, sometimes just grrrrrrrrrrr.

Tuesday Morning
I’m getting out of bed as two dogs eagerly await a trip outside which they know will be followed by breakfast. I ask Siri to play the Postal Service. She responds: “Here you go” followed by music by the Postal Service. The music is at about 50% volume. Nice. But in three full days of use I’m feeling hesitant about HomePod and the Siri within. And the next moment illustrates why. I slip on my shoes and jacket and ask Siri to Pause. The music continues. I say it louder and the iPhone across the room pipes up: “You’ll need to unlock your iPhone first.” I ignore the iPhone and look directly at HomePod (5 feet away) and say louder as I get irritated “Hey Siri, pause!” Nothing. She does not hear me (maybe she’s  enjoying the music?). By now, the magic is long gone and is replaced by frustration. I raise my voice to the next level which is basically shouting and finally HomePod responds and pauses the music. Grrr.

I go outside with my canine friends and upon return ask Siri to turn off the porch light. The iPhone across the room responds and the light goes off. I ask her to play and the HomePod responds and the Postal Service resumes. I get my coffee and iPad and sit down to finish off this review. I lay the iPhone face down so it will no longer respond to Hey Siri. Then I say, Hey Siri, set the volume to 40%. Nothing. I say it louder and my kitchen light goes off followed by Siri happily saying “Good Night Enabled”. Grrrrrrrrrrrr. I say Hey Siri loudly and wait for the music to lower then say “set the Kitchen light to 40%” and she does. The music resumes and I say Hey Siri and again I wait then I say “Play the Owls” and she does. I’d forgotten that I also wanted to lower the volume. But see how this all starts to feel like work? There’s nothing magical or enjoyable about this experience.

Here’s what I wrote Sunday morning as I worked on this review:

“When I ordered the HomePod I had no doubt I would enjoy it. Unlike so many that have bemoaned the missing features I was happy to accept it for what Apple said it was. A great sounding speaker with Apple Music and Siri. Simple.

It really is that simple. See how I did that? Apple offered the HomePod and I looked at the features and I said yes please.

I then proceeded to write a generally positive review which is below and which was based on my initial impressions based on 1.5 days of use. By Monday I’d edited to add in more details, specifically the few failures I’d had with Siri and the frustration of iPhone answering when I didn’t want it to.

I went into the HomePod expecting a very positive experience. And it’s mostly played out that way. But it’s interesting that by Tuesday morning my expectation of failure and frustration have risen. Not because HomePod is becoming worse. I’d say it’s more about the gradual accumulation of failures. They are the exception to the rule but happen often enough to create a persistent sense of doubt.

Set-up
As has been reported. It’s just like the AirPods. I was done in two minutes. I did nothing other than plug it in and put my phone next to it. I tapped three or four buttons and entered a password. Set-up could not possibly be any easier.

Siri
In a few days of use I’m happy to report that HomePod has performed very well. In almost every request I have made Siri has provided exactly what I asked. My hope and expectation would be that Siri on HomePod would hear my requests at normal room voice. While iPad and iPhone both work very well, probably at about 85% accuracy I have to be certain to speak loudly if I’m at a distance. Not a yell1, but just at or above normal conversational levels. With HomePod on a shelf in my tiny house, Siri has responded quickly and with nearly 100% accuracy and that’s with music playing at a fairly good volume. Not only do I not have to raise my voice, I’ve been careful to keep it at normal conversational tones or slightly lower. I’ll say that my level is probably slightly lower than what most people in the same room would easily understand with the music playing.

For the best experience with any iOS device I’ve learned not to wait for Siri. I just say Hey Siri and naturally continue with the rest of my request. This took a little practice because earlier on I think Siri required a slight pause or so it seemed. Not any more. But there’s no doubt, Siri is still makes mistakes even when requesting music which is supposedly her strongest skill set.

The first was not surprising. I requested music by Don Pullen, a jazz musician that a friend recommended. I’d never listened to him before and no matter how I said his name Siri just couldn’t get it. She couldn’t do it from iPhone or iPad either. Something about my pronunciation? I tried, probably 15 times with no success. I did however discover several artists with names that sound similar to Don Pullen. I finally turned on type to Siri and typed it in and sure enough, it worked. I expect there are other names, be they musicians or things outside of Music that Siri just has a hard time understanding. I’ve encountered it before but not too often. The upside, the next morning I requested Don Pullen and Siri correctly played Don Pullen. Ah, sweet relief. A sign that she is “learning”?

Another fail that seems like a learning process for Siri, the first time I requested REM Unplugged 1991/21: The Complete Sessions she failed because I didn’t have the full name. I just said REM Unplugged and she started playing a radio station for REM. When I said the album’s full name it worked. I went back a few hours later and just said REM Unplugged and it worked. Again, my hope is that she learns what it is I’m listening to so that in the future a long album name or a tricky artist name will not confuse her. Will see see how it plays out (literally!).

Yet another failure, and this one really surprised me. I’ve listened to the album “Living Room Songs” by Olafur Arnalds quite a bit. I requested Living Room Songs and she began playing the album Living Room by AJR. Never heard of it, never listened to it. So, that’s a BIG fail. There’s nothing difficult about understanding “Living Room Songs” which is an album in my “Heavy Rotation” list. That’s the worst kind of fail.

One last trouble spot worth mentioning. I have Hey Siri turned on on both my iPhone and Apple Watch. Most of the time the HomePod picks up but not always. On several occasions both the phone and watch have responded. I’ve gotten in the habit of keeping the phone face down but I shouldn’t have to remember to do that. I definitely see room for improvement on this.

I’ve requested the other usual things during the day with great success: the latest news, played the most recent episode of one of my regular podcasts, gotten the weather forecast, current temperature, sent a few texts, used various Homekit devices, checked the open hours of a local store and created a few reminders. It all worked the first time.

There were a couple of nice little surprises. In changing the volume, it’s possible to just request that it be “turned up a little bit” or “down a little bit”. I’m guessing that there is a good bit of that natural language knowledge built in and we only ever discover it by accident. Also, I discovered that when watching video on the AppleTV, if the audio is set to HomePod, Siri works for playback control so there’s no need for the Apple remote! This works very well. Not only can Siri pause playback but fast forward and rewind as well.

Audio Quality
Of course Apple has marketed HomePod first and foremost as a high quality speaker, a smart Siri speaker second. I agree with the general consensus that the audio quality is indeed superb. For music and as a sound system for my tv, I am very satisfied. My ears are not as well tuned as some so I don’t hear the details of the 3D “soundstage” that some have described. I subscribe to Apple Music so that’s all that matters to me and it works very well. Other services or third party podcast apps, can be played from a Mac or iOS device via AirPlay to HomePod. I also use Apple’s Podcast app (specifically for the Siri integration) so it’s not an issue for me.

Voice First: Tasks and Music
The idea of voice first computing has caught on among some in the tech community who are certain that it is the future. I certainly have doubts. Even assuming perfect hardware that always hears perfectly and parses natural language requests perfectly (we’re not there yet) I certainly have problems with the cognitive load of voice computing. I’ll allow that it might just be a question of retraining our minds for awhile. It’s probably also a process of figuring out which things are better suited for voice. Certain tasks are super easy and tend to work with Siri via whatever device. This is the list of usual things people are doing because they require very little thinking: setting timers, alarms, reminders, controlling devices, getting the weather.

But let’s talk about HomePod and Siri as a “musicologist” for a moment. An interesting thing about playing music, at least for me, is I often don’t know what it is I want to play. When I was a kid I had a crate of records and a box of cassette tapes. I could easily rattle off 10 to 20 of my current favorites. Overtime it changed and the list grew. But it was always a list I could easily remember. Enter iTunes and eventually Apple Music. My music library has grown by leaps and bounds. My old favorites are still there but they are now surrounded by a seemingly endless stream of possibility. In a very strange way, choosing music is now kind of difficult because it’s overwhelming. On the one hand I absolutely love discovering new music. I’m listening to music I never would have known of were it not for Apple Music. I’ve discovered I actually like certain kinds of jazz. I’m listening to an amazing variety of ambient and electronic music. Through playlists I’ve discovered all sorts of things. But if I don’t have a screen in front of me the chances of remembering much of it is nil. If I’m lucky I might remember the name of a playlist but even that is difficult as there are so many being offered up.

So while music on the HomePod sounds fantastic when it’s playing I often have these moments of “what next?” And in those moments my mind is often blank and I need a screen to see what’s possible. I’m really curious to know how other people who are using voice only music devices decide what they want to play next.

Conclusion
There isn’t one. This is the kind of device that I want to have. I’m glad I have it. I enjoy it immensely. It is a superb experience until it isn’t which is when I want to throw it out a window. Hey Apple, thanks?


  1. Well, sometimes a yell is actually required. ↩︎

Siri and voice first

In a recent episode of his Vector podcast, Rene Ritchie had “voice first” advocate Brian Roemmele. Rene is probably my current favorite Apple blogger and podcaster and Vector is excellent.

As I listened to this episode I found myself nodding along for much of it. Roemmele is very passionate about voice first computing and certainly seems to know what he’s talking about. In regards to Siri, his primary argument seems to be that Apple made a mistake in holding Siri back after purchasing the company. At the time Siri was an app and had many capabilities that it no longer has. Rather than take Siri further and developing it into a full-fledged platform Apple reigned it in and took a more conservative approach. In the past couple of years it has been adding back in, via Siri Kit, what it calls domains.

Apps adopt SiriKit by building an extension that communicates with Siri, even when your app isn’t running. The extension registers with specific domains and intents that it can handle. For example, a messaging app would likely register to support the Messages domain, and the intent to send a message. Siri handles all of the user interaction, including the voice and natural language recognition, and works with your extension to get information and handle user requests.

So, they scaled it back and are rebuilding it. I’m not a developer but my understanding of why they’ve done this is, in part, to allow for a more varied and natural use of language. But as with all things internet and human, people often don’t want to be bothered with the details. They want what they want and they want it yesterday. In contrast to Apple’s handling of Siri we have Amazon which has it’s pedal to the floor.

Roemmele goes on to discuss the rapid emergence of Amazon’s Echo ecosystem and the growth of Alexa. Within the context of this podcast and what I’ve seen of his writing, much of his interest and background seems centered on commerce and payment as they relate to voice. That said, I’m just not that interested in what he calls “voice commerce”. I order from Amazon maybe 6 times a year. Now and in the foreseeable future I get most of what I need from local merchants. That said, even when I do order online I do so visually. I would never order via voice because I have to look at details. Perhaps I would use voice to reorder certain items that need to be replaced such as toilet paper or toothpaste but that’s the extent of it.

What I’m interested in is how voice can be a part of the computing experience. There are those of us that use our computers for work. For the foreseeable future I see myself interacting with my iPad visually because I can’t update a website with my voice. I can’t design a brochure with my voice. I can’t update a spreadsheet with my voice. I can’t even write with my voice because my brain has been trained to write as I read on the screen what it is I’m writing.

But this isn’t the computing Roemmele is discussing. His focus is “voice first devices”, those that don’t even have screens, devices such as the Echo and the upcoming HomePod1. And the tasks he’s suggesting will be done by voice first computing are different. And this is where it get’s a bit murky.

Right now my use of Siri is via the iPhone, iPad, AppleWatch and AirPods. In the near future I’ll have Siri in the HomePod. How do I make the most of voice first computing? What are these tasks that Siri will be able to do for me and why is Roemmele so excited about voice first computing. The obvious stuff would be the sorts of things assistants such as Siri have been touted as being great for: asking about for the weather, adding things to reminders, setting alarms, getting the scores for our favorite sports ball teams and so on. I and many others have written about these sorts of things that Siri has been doing for several years now. But what about the less obvious capabilities?

At one point in the podcast the two discuss using voice for such things as sending text. I often use dictation when I’m walking to dictate a text into my phone when using Messages and I see the benefit of that. But dictation, whether it is dictating to Siri or directly to the Messages app or any other app, at least for me, requires an almost different kind of thinking. It may be that I am alone in this. But it is easier for me to write with my fingers on the keyboard then it is to “write” with my mouth through dictation. It might also be that this is just a matter of retraining my brain. I can see myself dictating basic notes and ideas. But I don’t see myself “writing” via dictation.

At another point Roemmele suggests that apps and devices will eventually disappear as they are replaced by voice. At this point I really have to draw a line. I think this is someone passionate about voice first going off the rails. I think he’s let his excitement cloud his thinking. Holding devices, looking, touching, swiping, typing and reading, these are not going away. He seems to want it both ways though at various points he acknowledges that voice first doesn’t replace apps so much as it is a shift in which voice becomes more important. That I can agree with. I think we’re already there.

Two last points. First, about the tech pundits. Too often people let their own agenda and preference color their predictions and analysis. The lines blur between their hopes and preferences and what is. No one knows the future but too often act as they do. It’s kinda silly.

Second, what seems to be happening with voice computing is simply that a new interface has suddenly become useful and it absolutely seems like magic. For those of us who are science fiction fans it’s a sweet taste of the future in the here and now. But, realistically, its usefulness is currently limited to very fairly trivial daily tasks mentioned above. Useful, convenient and delightful? Yes, absolutely. Two years ago I had to go through all the trouble of putting my finger on a switch, push a button or pull a little chain, now I can simply issue a verbal command. No more trudging through the effort of tapping the weather app icon on my screen, not for me. Think of all the calories I’ll save. I kid, I kid.

But really, as nice an addition as voice is, the vast majority of my time computing will continue to be with a screen. I don’t doubt that voice interactions will become more useful as the underlying foundation improves and I look forward to the improvements. As I’ve written many times, I love Siri and use it every day. I’m just suggesting that in the real world, adoption of the voice interface will be slower and less far reaching than many would like.


  1. Actually, technically, the HomePod technically has a screen but it’s not a screen in the sense that an iPhone has a screen. ↩︎

Hey Siri, give me the news

Ah, it’s just a little thing but it’s a little thing I’ve really wanted since learning of a similar feature on Alexa. In fact, I just mentioned it in yesterday’s post. We knew this was coming with HomePod and now it’s here for the iPad and iPhone too. Just ask Siri to give you the news and she’ll respond by playing a very brief NPR news podcast. It’s perfect, exactly what I was hoping for. I’ve already made it a habit in the morning, then around lunch and again in the evening.

Alexa Hype

A couple years ago a good friend got one of the first Alexa’s available. I was super excited for them but I held off because I already had Siri. I figured Apple would eventually introduce their own stationary speaker and I’d be fine til then. But as a big fan of Star Trek and Sci-fi generally, I love the idea of always present voice-based assistants that seem to live in the air around us.

I think he and his wife still use their Echo everyday in the ways I’ve seen mentioned elsewhere: playing music, getting the news, setting timers or alarms, checking the weather, controlling lights, checking the time, and shopping from Amazon. From what I gather that is a pretty typical usage for Echo and Google Home owners. That list also fits very well with how I and many people are using Siri. With the exception of getting a news briefing which is not yet a feature. As a Siri user I do all of those things except shop at Amazon.

The tech media has recently gone crazy over the pervasiveness of Alexa at the 2018 CES and the notable absence of Siri and Apple. Ah yes, Apple missed the boat. Siri is practically dead in the water or at least trying to catch-up. It’s a theme that’s been repeated for the past couple years. And really, it’s just silly.

Take this recent story from The Verge reporting on research from NPR and Edison Research

One in six US adults (or around 39 million people) now own a voice-activated smart speaker, according to research from NPR and Edison Research. The Smart Audio Report claims that uptake of these devices over the last three years is “outpacing the adoption rates of smartphones and tablets.” Users spent time using speakers to find restaurants and businesses, playing games, setting timers and alarms, controlling smart home devices, sending messages, ordering food, and listening to music and books.

Apple iOS devices with Siri are all over the planet rather than just the three or four countries the Echo is available in. Look, I think it’s great that the Echo exists for people that want to use it. But the tech press needs to pull it’s collective head out of Alexa’s ass and find the larger context and a balance in how it discusses digital assistants.

Here’s another bit from the above article and research:

The survey of just under 2,000 individuals found that the time people spend using their smart speaker replaces time spent with other devices including the radio, smart phone, TV, tablet, computer, and publications like magazines. Over half of respondents also said they use smart speakers even more after the first month of owning one. Around 66 percent of users said they use their speaker to entertain friends and family, mostly to play music but also to ask general questions and check the weather.

I can certainly see how a smart speaker is replacing radio as 39% reported in the survey. But to put the rest in context, it seems highly doubtful that people are replacing the other listed sources with a smart speaker. Imagine a scenario where people have their Echo playing music or a news briefing. Are we to believe that they are sitting on a couch staring at a wall while doing so? Doing nothing else? No. The question in the survey: “Is the time you spend using your Smart Speaker replacing any time you used to spend with…?”

So, realistically, the smart speaker replaces other audio devices such as radio but that’s it. People aren’t using it to replace anything else in that list. An Echo, by it’s very nature, can’t replace things which are primarily visual. As fantastic as Alexa is for those that have access to it, for most users it still largely comes down to that handful of uses listed above. In fact, in another recent article on smart speakers, The New York Times throws a bit of cold water on the frenzied excitement: Alexa, We’re Still Trying to Figure Out What to Do With You

The challenge isn’t finding these digitized helpers, it is finding people who use them to do much more than they could with the old clock/radio in the bedroom.

A management consulting firm recently looked at heavy users of virtual assistants, defined as people who use one more than three times a day. The firm, called Activate, found that the majority of these users turned to virtual assistants to play music, get the weather, set a timer or ask questions.

Activate also found that the majority of Alexa users had never used more than the basic apps that come with the device, although Amazon said its data suggested that four out of five registered Alexa customers have used at least one of the more than 30,000 “skills” — third-party apps that tap into Alexa’s voice controls to accomplish tasks — it makes available.

Now, back to all the CES related news of the embedding of Alexa in new devices and/or compatibility. I’ve not followed it too closely but I’m curious about how this will actually play out. First, of course, there’s the question of which of these products actually eventually make it to market. CES announcements are notorious for being just announcements for products that never ship or don’t ship for years into the future. But regardless, assuming many of them do, I’m just not sure how it all plays out.

I’m imagining a house full of devices many of which have microphones and Alexa embedded in them. How will that actually work? Is the idea to have Alexa, as an agent that listens and responds as she currently does in a speaker, but also in all of the devices be they toilets, mirrors, refrigerators… If so, that seems like overkill and unnecessary costs. Why not just the smart speaker hub that then intelligently connects to devices? Why pay extra for a fridge with a microphone if I have another listening device 10 feet away? This begins to seem a bit comical.

Don’t get me wrong, I do see the value of increasing the capabilities of our devices. I live in rural Missouri and have a well house heater 150 feet away from my tiny house. I now have it attached to a smart plug and it’s a great convenience to be able to ask Siri to turn it off and on when the weather is constantly popping above freezing only to drop below freezing 8 hours later. It’s also very nice to be able to control lights and other appliances with my voice, all through a common voice interface.

But back to CES, the tech press and the popular narrative that Alexa has it all and that Siri is missing out, I just don’t see it. A smart assistant, regardless of the device it lives in, exists to allow us to issue a command or request, and have something done for us. I don’t yet have Apple’s HomePod because it’s not available. But as it is now, I have a watch, an iPhone and two iPads which can be activated via “Hey Siri”. I do this in my home many times a day. I also do it when I’m out walking my dogs. Or when I’m driving or visiting friends or family. I can do it from a store or anywhere I have internet. If we’re going to argue about who is missing out, the Echo and Alexa are stuck at home while Siri continues to work anywhere I go.

So, to summarize, yes, stationary speakers are great in that their far-field microphones work very well to perform a currently limited series of tasks which are also possible with the near-field mics found in iPhones, iPads, AirPods and the AppleWatch. The benefit of the stationary devices are accurate responses when spoken to from anywhere in a room. A whole family can address an Echo whereas only individuals can address Siri in their personal devices and have to be near their phone to do so. Or in the case of wearables such as AirPods or AppleWatch, they have to be on person. By contrast, these stationary devices are useless when we are away from the home when we have mobile devices that still work.

My thought is simply this, contrary to the chorus of the bandwagon, all of these devices are useful in various ways and in various contexts. We don’t have to pick a winner. We don’t have to have a loser. Use the ecosystem(s) that works best for you If it’s Apple and Amazon enjoy them both and use the devices in the scenarios where they work best. If it’s Amazon and Google, do the same. Maybe it’s all three. Again, these are all tools, many of which compliment each other. Enough with the narrow, limiting thinking that we have to rush to the pronouncement of a winner.

Personally, I’m already deeply invested in the Apple ecosystem and I’m not a frequent Amazon customer so I’ve never had a Prime membership. I’m on a limited budget so I’ve been content to stick with Siri on my various mobile devices and wait for the HomePod. But if I were a Prime member I would have purchased an Echo because it would have made sense for me. When the HomePod ships I’ll be first in line. I see the value of a great sounding speaker with more accurate microphones that will give me an even better Siri experience. I won’t be able to order Amazon products with the HomePod but I will have a speaker with fantastic audio playback and Siri which is a trade off I’m willing to make.

HomePod and the Siri Ecosystem

homepod-white-shelf

I’ve recently written about my hopes for a more proactive Siri. I’ve written about Siri quite a bit over the past couple years and it’s been mostly positive. Frankly, I think “she” is pretty fantastic and I call upon her many times a day. I remember when a friend first showed me his Echo a couple years back. I instantly wanted a Siri powered speaker by Apple. I’ve been waiting ever since. I will buy the HomePod the day it becomes available. No hesitation and with the same excitement and for the same reasons as I bought the AirPods the minute they were available: music, podcasts, Siri.

But much of the tech press has another take on the voice assistant market. Over the past couple of years it’s become fashionable in the tech media, especially among the Apple nerds that love to pride themselves on their very high standards, to complain about Siri while holding up high Alexa and the Echo1.

Interestingly, Pew has recently come out with an article on the very issue of voice assistants by Americans. Not too surprisingly 46% use voice assistants and of that 42% access via smartphone. 14% access via computer or tablet. Only 8% access via a stand-alone device such as an Echo. My own experience and observation of family mirror this. Practically all of my extended family have and use Siri on a myriad of devices on a daily basis. But in that same group the Echo is only in one household.

There are a few exceptions to the common chorus of the bandwagon and two of my favorites are Daniel Erin Dilger and Neil Cybart. They offer a more mature, big picture analysis. It’s less about whether or not they are personally pleased with how a product suits them but more about the larger context and trends. They seem to do a much better job of taking into account how the potential interactions of the larger public will play out.

I agree with their recent analysis of the HomePod, digital assistants and devices. Their posts were less about Siri and more about the varied form factors of devices through which digital assistants are accessed as well as the larger function of those devices.

First, Daniel Erin Dilger over at AppleInsider recently discussed the intent of Apple’s upcoming HomePod. I agree with his take on it, nicely summarized by the article title, Apple’s HomePod isn’t about Siri, but rather the future of home audio. He does a great job of digging into the difference in the intended function of the devices. Specifically, the role of the devices in the home. This bit comparing the audio quality of HomePod to the Echo made me giggle:

It’s an emotional experience, which is exactly what Apple has been increasingly pursuing as it enhances its products. Amazon Alexa isn’t an emotional experience; it’s an intellectual one. It’s a polite conversation with a librarian who moonlights as a sales agent at an online warehouse and plays songs with the fidelity of a clock radio.”

Over at Above Avalon Neil Cybart recently wrote a post in which he explores, in part, the bias of the media in relation to the perceived success of the platforms and ecosystems. As he has in the past, he does an excellent job of providing some context about where the market is at based on numbers rather than the din emanating from the excited bandwagon.

We are in the midst of a massive mindshare bubble involving stationary smart speakers in the home. While the press talk up the category with near breathless enthusiasm and positivity, there is a growing amount of evidence that stationary smart speakers powered by digital voice assistants do not represent a paradigm shift in computing. Instead, the stationary smart speaker’s future is one of an accessory, and it will be surpassed in prominence by wearables. It’s time to call out the stationary smart speaker market for what it is: a mirage.

On more than one occasion Neil has compared the consistent praise heaped upon the Echo as compared to the criticism put upon the Apple Watch which, in terms of sales, has not only sold more units but at a greater profit. Amazon is practically giving away its Echo devices. Put another way, in terms of the number of form factors offered by each ecosystem, Apple’s is far more diverse and as a result, more useful and it’s also making a profit for the company.

Another of my favorites and the host of the Vector Podcast, Rene Ritchie, has recently covered the topic with guests in two different episodes.

The first, in which Rene interviewed Jan Dawson in episode 18, mirrored the points made Neil Cybart. Jan points out that contrary to the fixation on the Echo, it’s actually Siri that has the largest number of users. The only way that Apple appears to be behind is if we focus only on home speaker hardware such as the Echo, a market Apple has yet to enter but will enter in early 2018. But he correctly points out that Alexa-based hardware, as a share of the voice assistant market, is actually very small. He also makes the point that voice, as a computer UI, is still only one in a larger pool of UIs and that it is often not appropriate for use in many settings.

In the second, Rene interviews Ben Bajarin in episode 35. In this second interview Ben Bajarin disagrees somewhat the the above three takes. He suggests that there is a large and growing market for what he calls ambient computing. He suggests that Google is in the weakest position with search being it’s differentiator. Amazon is in a very good position with it’s differentiator being commerce. He goes on to say that Apple is very much a part of the market and that it’s differentiator is communications and service. He also suggests that while Apple is already in the game he looks for them to do more a lot more.

In the second half of the interview there is a great conversation about Siri’s future. Ben Contends that Siri is already quite good and that the general public is largely satisfied with it. They then delve into machine learning for Siri and the line that Apple is walking in regards to differential privacy as a technique for collecting large-scale social data and how this contrasts with more personalized data as the base for machine learning. It’s a great conversation.

I’m with Ben that I hope Apple can figure out how to better personalize it’s approach. In short, many of us trust Apple with our data and would like Apple to use it for a more fine-tuned machine learning for individuals. Their discussion concludes on the problem of making Siri in the HomePod work in the context of families. How can a communal device be used in both a communal context and an personal one? Or can it?

It will be interesting to see how the Apple’s Siri ecosystem evolves. While Amazon is offering various Echo devices all of which are tied to the home2, Apple offers global distribution in many languages in form factors ranging from wearables to pocketable to carried and soon, a stationary, home device. In the Apple ecosystem I am connected to Siri and thus, a variety of services, all of the time. Or, more precisely, anywhere I have an internet connection which, these days is almost everywhere I am. In other words, Siri is already providing, to a great degree, ambient voice-controlled computing.

This plethora of devices has me covered in a variety of circumstances but I’m curious to see how that works in the home with the HomePod. I’ll have an iPhone and iPad sitting within range all the time. The watch on my wrist and the HomePod somewhere in the room. What will happen when I say “Hey Siri”? I’m just one person in a tiny house. I’m also curious what happens when you have 4 or 5 family members each with devices in a larger house. Currently Siri does a good job of responding only to the device owner and even does a pretty good job of responding with the right device when several “Hey Siri” devices are within range. I assume this will be no different when the HomePod is introduced.

The new year is around the corner and with it the HomePod. I’m looking forward to trying it out. I don’t doubt that Apple is working on making all of it’s Siri devices work well together. I’ve got a spot on my shelf waiting for the new arrival.


  1. I‘ve come to dislike the Apple-oriented echo-chamber because they seem to approach every new thing from a very narrow perspective. To my ears and eyes, it comes off as snobbery. ↩︎
  2. And just a handful of countries ↩︎