"If you have more than one Echo or Echo Dot, you can set a different wake word f...

ansible · on March 3, 2016

Since the processing is cloud based, and they know my identity, why don't the devices recognize this fact and cooperate. Instead of just 7 beam forming mics in the Echo, if you have two within hearing distance you could have the benefit of 14 and a unified response.

The echo and noise reduction software that I'm aware of can't really do that in a reasonable fashion.

With current solutions, you've got one DSP that's receiving all the audio streams simultaneously, and they need to be exactly synchronized in time. Then, using basically pattern-matching, it figures out what direction the user's voice is coming from, and combines some/all of the audio streams together to eliminate environmental noise and make the speech as clear as possible.

To do this with separate devices, you'd want extremely precise time synchronization. Which is possible, but I wouldn't want to implement it.

The extra processing and synchronization would take longer, and delay input to the speech recognition engine. I don't think it would enhance the user experience.

Edit: spelling.

t0mbstone · on March 3, 2016

Just have the Echo that hears the person best be the one that responds. So simple, and easy to implement. I honestly don't understand why Amazon hasn't fixed this yet. It's so fucking obvious.

joekrill · on March 4, 2016

> So simple, and easy to implement.

Ah yes, the rally cry of the person not doing the actual development work... In my experience, rarely is _anything_ "So simple, and easy to implement".

ansible · on March 3, 2016

Agreed. Doing something sensible at a higher level than the actual audio recording would be easily possible.

visarga · on March 4, 2016

> I don't think it would enhance the user experience.

Baidu trains the voice recognizer by adding all kinds of noise to the training data. I think it might be easier to do that than use multiple microphones. The neural net learns to do the difficult process of separation of useful data from noise.

mrbill · on March 3, 2016

I learned to not have the wake word be "Amazon" when I was watching online training for AWS. The Echo went nuts until I finally paused everything and changed the wake word back to "Alexa".

t0mbstone · on March 3, 2016

They really need to make it so that all of the Amazon Echos on the same network use a proximity algorithm to determine which one responds. Simply: The Echo that hears you best should be the one to respond.

I want to have an Echo in every room, and I don't want to have to remember all their different names!

jimktrains2 · on March 3, 2016

> I have 3 separate devices nearby that are Google Now voice activated (the newer devices support this even if the screen is off), and they will sometimes trigger at the same time accidentally.

> Since the processing is cloud based, and they know my identity,

Interesting, so everything said in that room gets processed and potentially sent to Google for indefinite storage? What a 1984-style luxury.

Karunamon · on March 3, 2016

AFAIK, every one of these devices does nothing until a "wake word" is heard, and only then do they record+send.

Having all of the devices listening all the time would be a bandwidth and power nightmare, if not for the sender, for the receiver.

mavhc · on March 3, 2016

Correct, of course it can activate accidentally.

https://history.google.com/history/audio has a list of all audio recorded

jimktrains2 · on March 3, 2016

What about accidental triggers of the wake word? What about planted "wake words" to record people discussion "inappropriate" things?

cthalupa · on March 3, 2016

For the Echo, at least, it has to use your home network, so you could pretty easily run a packet capture to see if it's ever sending audio out when you don't want it to.

Harder for things with cellular data, though.

jimktrains2 · on March 3, 2016

It's a cat-and-mouse game: What if it only sent the clandestine information when it picks up the "normal" word? The point is you don't control the device or its software.

gr3yh47 · on March 3, 2016

>What a 1984-style luxury

exactly why I think none of this is worth it (echo, google now, siri, smart tvs etc) - especially given the current applications of the 3rd party doctrine - you are giving up the right to privacy for everything that is said in your home.

masonhipp · on March 3, 2016

I agree with this entirely. I've been waiting patiently for a way to add microphone distance to my Echo and this is perfect for that... except it doesn't work that way.

I am very much hoping they fix it in the future and add a software layer to combine/route commands with one single wake word.

Nullabillity · on March 3, 2016

It's also a bit annoying that the Android Wear version of Now doesn't work the same as the regular Android version. For example, the full-sized one seems much more flexible with wording, and supports listening in several languages at once, while Wear is limited to one language.

Touche · on March 3, 2016

But it's limited to 3 words which is weird. I'd rather three "Office, order socks" than try to remember that the one in my office goes by "Amazon".