Kindle collects a surprisingly large amount of data

falcolas · on Aug 25, 2020

This statement - "None of these requests appear to be used for customer features like last read location." - bugs me, because it's fairly obviously false, and detracts from the real concerns.

To sync a "last read page" across devices, you need to send a location back to Amazon. It's also appropriate to tie a location to a device, so you can pick the appropriate device to sync your position from. And, when you highlight a word, the translation, definition, and wiki page is brought up, so of course it's being sent to bing and wikipedia.

There are valid concerns here (there's too much information being sent overall - the location data doesn't need to be sent with every page turn, for example), but these concerns are being buried behind FUD about none of this data needing to be transmitted.

EDIT: Can I also point out the ironic nature of griping about Amazon's analytics collection while running an analytics suite on the webpage yourself?

zql=Kindle%20Collects%20a%20Surprisingly%20Large%20Amount%20of%20Data pqo=1 xfg=1 xqi=946451 h=8 m=58 s=11 eqm=https%3A%2F%2Fnullsweep.com%2Fkindle-collects-a-surprisingly-large-amount-of-data%2F uel=https%3A%2F%2Fnews.ycombinator.com%2F nvn=b271bb7f9e0fe444 xpx=1598364493 bqq=2 oso=0 ajh=1598366510 lyz=1598364493 _ref=https%3A%2F%2Fnews.ycombinator.com%2F euq=0 cookie=1 res=1080x1920 fpr=429 rlp=xnxpI1

BCharlie · on Aug 25, 2020

I mention that the data that appears to be used for those purposes is sent again in a separate request to a separate end point, so we have two types of requests: last read location, and reading analytics. Sorry it wasn't clear, I'll try to improve the wording.

falcolas · on Aug 25, 2020

Will you also be updating and noting that the requests to Wikipedia and Bing are for explicit customer-benefiting features?

Might be worth noting that you can opt out of their data collection (on the e-reader, at a minimum) as well. Settings > Device Options > Advanced Options > Privacy or in the device management console in your account on amazon.com

danShumway · on Aug 25, 2020

The text in question:

> Highlighting or tapping any word will send the requests with the text to Bing Translate and Wikipedia, as well as back to Amazon.

Is there a reason why that text needs to be sent before the user clicks the "translate" button? Is there a reason why it needs to be sent to Amazon?

fouric · on Aug 25, 2020

> Is there a reason why that text needs to be sent before the user clicks the "translate" button?

Yes - UX latency. I would expect this kind of thing to take a few thousand milliseconds, and shaving off a few hundred milliseconds from between when the user highlights text and when they select "translate" is significant. The fact that this data is being sent to Wikipedia of all places further signals that the usage is likely to be innocuous.

Do I think that this is globally a good design decision? No, for both engineering and privacy reasons. There's definitely no good reason why it should be sent to Amazon at all.

falcolas · on Aug 25, 2020

> There's definitely no good reason why it should be sent to Amazon at all.

I was wracking my brain on this, and all I could come up with was "to independently verify the invoicing for Bing translations" and "how many times are people accessing the definition/translation and not highlighting". So, analytics, not something that explicitly benefits the user.

benmller313 · on Aug 25, 2020

Can we stop pretending that analytics don't explicitly benefit the user? Product Engineering organizations rely on analytics to improve user experiences.

naikrovek · on Aug 25, 2020

Analytics can be done less granularly and still benefit the user. Also, surely not every data point collected is used to benefit the user.

For example, Amazon doesn't need to know where I am when I request a definition or translation. If they're concerned about usage, they only need to know how many times I actually used one or both of those features per day, per week, or month. They don't need to know instantly every single time a word is highlighted.

edanm · on Aug 26, 2020

> Analytics can be done less granularly and still benefit the user. Also, surely not every data point collected is used to benefit the user.

How? For all we know, it isn't granular - it might be aggregated at the server level to hide specific user's actions. But they'd still need to be sending in the data from the device to the server.

wongarsu · on Aug 26, 2020

The device could keep a daily count of interesting actions, and sync that to analytics servers on a daily or weekly basis. That preserves 95% of legitimate use cases while leaking much less private data (like how my reading habits are distributed across the day)

edanm · on Aug 26, 2020

I mean, you're still collecting most of the problematic data. And you might legitimately be interested in what you're leaving out - knowing time of day that people do things is actually important for plenty of use cases.

_abox · on Aug 25, 2020

They can but they're often much more than that.

Also it should really be opt in. Our at least opt out. I hate Amazon looking over my shoulder while reading a book.

patcul100 · on Aug 25, 2020

That's why I don't use their reading app and use a custom OS.

SamReidHughes · on Aug 26, 2020

The word choice we want here is directly vs. indirectly.

throwaway_pdp09 · on Aug 25, 2020

I'm surprised you'd say that. Out of interest, how does analytics help websites not use blathery, unhelpful text in overly-small fonts, done too-pale to make them unreadable. A lot of UI failings are of this most basic kind.

luckylion · on Aug 25, 2020

When you play an online slot game where you bet money that some numbers will appear on screen, and they use analytics to "improve user experience" (read: engagement, read: you losing more money), is that benefiting you or is it benefiting them?

bberenberg · on Aug 25, 2020

Kindle devices have a dictionary on device. By looking into which words are most frequently defined, they can add these to the local dictionary to help improve the speed of the UI.

freeone3000 · on Aug 25, 2020

The screen refresh rate on these devices is measured in seconds, so a few hundred millis of network latency is impossible to display.

fouric · on Aug 25, 2020

This isn't universally true - Dan Luu's computer latency page[1] lists three Kindles, all below 900 ms of latency. And, since some devices have latency as low as 570 ms, it makes sense that they would use this optimization.

[1] https://danluu.com/input-lag/

notatoad · on Aug 25, 2020

have you actually used a kindle? it certainly doens't take seconds for the definitions to pop up. a full-page refresh might take a second, but most page turns or UI interactions are partial draws and are much faster.

II2II · on Aug 26, 2020

I suspect that they were overstating a limitation of these devices rather than speaking from inexperience. While it has been years since I've used a Kindle, I do use Kobo devices and the delays are perceptible. While changing a page may be quite quick, user interface elements (such as a box containing a definition) seem to take longer. I suspect that they have to be more agressive when refreshing the screen before and after these user interface elements are displayed in order to make the ghosting less perceptible.

If you want to see what I mean by the ghosting of user interface elements being more perceptible, try using KOReader. The ghosting after using a menu can be quite noticable (at least on Kobo devices, which are based on the same technology).

dahfizz · on Aug 26, 2020

You're exaggerating how slow the screens are.

And the fact that the screens are slow should be motivation to make the rest of the system as responsive as possible. A good software engineer will work around bottlenecks, not shrug their shoulders and introduce new ones.

lapetitejort · on Aug 25, 2020

Also remember that "Kindle" can refer to an app on your phone or desktop computer, all of which may share code related to highlighting and translating.

shajznnckfke · on Aug 25, 2020

That doesn’t seem right. Let’s consider the screen refresh to be like a subway station, where the train shows up every few seconds. We need the text we want to show to the user to be at the stop waiting when the train arrives. If we miss the train, we need to wait for the next train to get our text on the screen. The network latency delays when we show up to wait at the station.

If the refresh rate is 5 seconds, and the network response time is 500ms, than eliminating the 500ms response time means we are 10% less likely to miss the train. On average, the time for the text to appear on the screen decreases by 500ms.

All this assumes the refreshes happening on a static schedule. If the software can trigger the refresh, then it’s a lot simpler. The 500ms improvement in latency would apply equally to every engagement with the translate feature.

freeone3000 · on Aug 25, 2020

There's no static schedule. It's an e-ink display. Refreshes happen when software tells it to display something new and take several hundred millis per blank - and a screen can be up to three blanks (because if it doesn't go white-black-display, then some pixels get stuck "on" or "off" or "halfway").

shajznnckfke · on Aug 25, 2020

In that case, it’s clear that eliminating the network request before triggering the refresh directly reduces the amount of time the user has to wait to see the result.

majormajor · on Aug 25, 2020

There isn't a "translate button" - the selection of the word i the button for define/translate/wiki. You swipe between the three cards.

I like this, as a user. I don't want MORE buttons to tap through when I'm trying to define or translate a word. Especially since the Kindle eink screen and UI is not the most responsive.

jonahrd · on Aug 25, 2020

This is literally my #1 used feature of my Kindle. I read texts in different languages to have a quick access to single-tap translations.

If it took 2 taps, I would switch platforms.

larrik · on Aug 25, 2020

On the iOS app it all appears instantly-ish when I highlight, so I'm guessing it's just the same codebase.

halbritt · on Aug 25, 2020

> Might be worth noting that you can opt out of their data collection (on the e-reader, at a minimum) as well. Settings > Device Options > Advanced Options > Privacy or in the device management console in your account on amazon.com

Good tip, I'm going to give this a whirl. Unfortunately, all the network calls add a significant amount of latency even if one didn't care about privacy.

TedDoesntTalk · on Aug 25, 2020

Can you provide the URLs so we can use pihole to block the requests?

boogies · on Aug 25, 2020

(off-topic) What’re the advantages of pihole over /etc/hosts?

_abox · on Aug 25, 2020

That it works for all devices on your network. Even ones that don't have an etc/hosts :)

garblegarble · on Aug 25, 2020

>(off-topic) What’re the advantages of pihole over /etc/hosts?

It's good for cases exactly like this - devices where you don't have control over /etc/hosts (or where you have lots of them and don't want to keep the hosts files in sync). I use it for my Samsung TV to keep them from phoning home (but still letting me use apps)

Edit: you can also set up a DoH endpoint and filter traffic while also allowing Encrypted SNI to work

thaumasiotes · on Aug 26, 2020

> It's good for cases exactly like this - devices where you don't have control over /etc/hosts

Is the pihole a DNS server or a firewall? Sibling comments suggest it's a DNS server, but that doesn't answer this need at all -- if you don't control /etc/hosts, you don't control the device. It can do its resolution however it wants. Most obviously, it can include the domain names you don't want it to reach in its own /etc/hosts file, which you just said you didn't control.

stock_toaster · on Aug 26, 2020

In addition to sibling replies which point out network-wide usefulness... pihole (or any dns server) can/will return NXDOMAIN instead /etc/hosts which will only return an ip. A dns server can also be configured to match a domain and any subdomain (wildcard match) without having to specify each entry individually.

_underfl0w_ · on Aug 25, 2020

They both work similarly if you're using them to block outbound requests, but a Pi-Hole would intercept and block outbound requests for every device on the network where it's installed, whereas editing /etc/hosts would only block requests on a single device (unless that device is your router, I guess?)

Hitton · on Aug 25, 2020

I liked the article. If you are gonna update it, please consider also mentioning technical aspect. Frankly, Amazon snooping on users is to be expected, but short mention of app for which platform have you analysed using which tools would be welcome addition.

ballenf · on Aug 25, 2020

> Frankly, Amazon snooping on users is to be expected

Snooping on users during e-commerce transactions, sure.

But recording user's detailed interactions with every ebook? I hope that's a big surprise to your average Kindle user.

It would be great to see a data request response and how much of this data is retained and for how long. It's clearly not anonymized at the request level.

Very easy to see a future where just reading certain books or reading certain books too many times could flag you as dangerous or be used to support a mental incompetence hearing resulting in loss of rights.

thaumasiotes · on Aug 26, 2020

> But recording user's detailed interactions with every ebook? I hope that's a big surprise to your average Kindle user.

I doubt it. Here are some features the Kindle phone app intentionally advertises to the user:

- prediction of how long the book will take to complete, based on your reading rate

- tracking of whether or not you read anything on any given day

crooked-v · on Aug 25, 2020

I believe page location analytics are used for the amount of money that goes to Kindle Unlimited authors, also.

It can't just track the very last page in the book that you read, because authors were gaming that by encouraging people to immediately skip to the last page of very large works they didn't otherwise care about. Instead there's some kind of heuristic that tries to figure out if you've more-or-less-normally read the book.

falcolas · on Aug 25, 2020

A good point, since KU authors are paid per page read. Lots of fraud potential there.

rtkwe · on Aug 25, 2020

I think the reason to send a sync every page turn is you don’t know if the device will be in contact when any alternate sync trigger happens so to keep it mostly up to date the best option is to constantly sync whenever you have connectivity.

raxxorrax · on Aug 25, 2020

I honestly don't mind the FUD as long as user don't have options. Amazon deserves the bad press in that case. Kindle is an awesome screen reader, but such features make it a bad device. A good device just had an option "sync usage data to Amazon account" <yes/no>. People suggest it is a technical impossibility.

It is just a shame that you have no options. Had to quickly search if my kindle has GPS capabilities. Gladly it does not.

"Kindle Collects a Surprisingly Large Amount of Data" is a completely honest and in my opinion correct statement. So yes, companies are dishonest in their data collection practices and responding with exaggeration is maybe wrong. But I do care more about the data collection issue.

theptip · on Aug 25, 2020

> A good device just had an option "sync usage data to Amazon account"

The Kindle has an option to "sync last page", which you can turn off -- that sounds like it could be exactly what you're asking for, but more experimentation would be needed to know for sure.

I didn't see any mention of this config in the OP, aside from mentioning that the feature exists, so it's unclear whether the data being sent is used just for that feature, or whether less data is sent if the sync feature is turned off.

falcolas · on Aug 25, 2020

I pointed this out in a thread, but with the e-reader devices at least, you do have an option. It's opt-out, which sucks, but it does exist.

danShumway · on Aug 25, 2020

> which sucks

Note that it doesn't just suck because you're giving up using the Kindle itself. It also sucks because you'll be losing your entire collection of Ebooks, which are DRM-encumbered and can not be ported to other non-Amazon devices/platforms/apps.

This makes it extremely difficult for other privacy-respecting platforms to compete on the market, since using them requires the user to either break the law by stripping DRM from their books, or to abandon their entire purchased library.

Future TOS/EULA/Privacy changes that might not have been in place when a user originally bought their Kindle can thus be forced on them by making it prohibitively expensive for the user to opt out or change ecosystems.

falcolas · on Aug 25, 2020

I think there's a bit of a misunderstanding - you can turn off analytics on your e-reader without giving up the kindle platform. It's also separate from whispersync (which can also be disabled independently).

danShumway · on Aug 25, 2020

Just for clarification -- is this something that actually turns off the collection itself?

I'm seeing conflicting things online that range from "just hit this toggle and you're good", to "you can disable some of it, but not all", to "this only opts out of data processing for ads/analytics".

If there really is an option to disable the collection entirely, then that would mitigate a large number of the problems I have with that practice. Of course I'd love for it to be opt-in, but just giving the option would still be better than many other devices like Smart TVs.

JoshuaDavid · on Aug 25, 2020

Kindles have airplane mode and allow you to load books onto them using the USB connection. The battery also lasts somewhat longer if you use them that way. Amazon directly offers a "Download & Transfer via USB" option for ebooks you purchase in their store, as well -- this is a relatively well-supported use case.

It does mean that if you want to be absolutely sure your Kindle isn't phoning home, you can't use the Kindle browser, and you need a laptop or similar to download the things you want to transfer over. It's not a perfect solution for everyone, but for the typical HN reader who is concerned about telemetry, it should work.

walton_simons · on Aug 25, 2020

I've done this. Mine has been in aeroplane mode since the day I got it. I seem to remember having to allow it to connect to Amazon once when I first took it out of the box, but since then, no network connectivity at all, and zero problems as a result. It's been great.

I download the ebooks themselves using the Kindle application on my computer (if I'm using Amazon to get them, which I don't always), and then use Calibre to manage/import/convert/strip DRM from them. I don't need the sync functionality, or to be able to look things up on the internet (not being able to do that is a feature as far as I'm concerned!). I just want text on a page. I like the "e-reader" experience, and I have no desire to read books on a phone or tablet. I have one Kindle, and it comes with me if I think I'm going to have the opportunity to read when I'm out of the house.

Of course, if you're using Amazon to get your books they'll still build a profile of your reading habits, but there's something about tracking the exact parts of a book I'm reading, the bits I might linger on or reread, which feels extra intrusive to me, and which I categorically don't want.

thaumasiotes · on Aug 26, 2020

> Mine has been in aeroplane mode since the day I got it. I seem to remember having to allow it to connect to Amazon once when I first took it out of the box, but since then, no network connectivity at all, and zero problems as a result. It's been great.

I also never connect my Kindle to the internet. (The phone app does connect.) You don't have to allow it to connect to Amazon once. Mine has never connected.

samatman · on Aug 25, 2020

In isolation, "last read page" could surely be E2E encrypted. Amazon would know that I'm using a Kindle app or device, but everything else could be opaque.

There's no motive on Amazon's part to do it this way, it would be a hassle to implement, possibly not great for battery life, and I expect that users don't care much.

Frankly, I don't care much, in practice. In principle, yes; everything which can be kept private, should be. But Amazon knowing what page I'm on just doesn't discomfit me, the way the prospect of some company being able to read my messages does.

dahfizz · on Aug 26, 2020

The pro-privacy crowd needs to choose it's battles.

The most common response about online privacy is "what does is matter if X knows Y? I've got nothing to hide".

People already don't care, and I guarantee they also don't care that Amazon knows what page they are on in the book the are reading. There are much bigger issues to focus on

neiman · on Aug 25, 2020

Can't you do lost of those things by sending encrypted data to Amazon, and getting back the encrypted data from them? They act as a storage in most cases, not as a server, no?

falcolas · on Aug 25, 2020

You'd have to figure out some kind of secure key sharing mechanism between phones, tablets, web browsers, and e-readers.

Or, you can trust that a position in a book (bookmarks, notes, etc.) is not sensitive information that really needs to be encrypted. This is my - perhaps overly pragmatic - position.

criddell · on Aug 25, 2020

I think the books you read and your annotations should definitely be protected. Imagine reading about Tienanmen Square in China.

falcolas · on Aug 25, 2020

Simply purchasing/owning a book on that topic would be enough for an oppressive government like China, they wouldn't need to know where in the book you were exactly.

criddell · on Aug 25, 2020

Some books sold in China are edited for that market. If you highlight a passage that shouldn't be in your book, you could be in trouble.

gambler · on Aug 25, 2020

>You'd have to figure out some kind of secure key sharing mechanism between phones, tablets, web browsers, and e-readers.

Yeah, it's not like Amazon can afford security experts to work on this or anything.

>Or, you can trust that a position in a book (bookmarks, notes, etc.) is not sensitive information that really needs to be encrypted.

This is an ignorant position that has been proven wrong over, and over, and over again. Private data should be secure by default, because otherwise eventually someone will figure out how to abuse it. This is a lesson form bazillion fraud schemes and social engineering hacks everyone in tech should have learned by now.

dahfizz · on Aug 26, 2020

Amazon could also afford to fill the Panama canal with dirt and reunite the American continents, but why would they? A dozen angry (potential) customers on HN is hardly motivation.

jacobr1 · on Aug 25, 2020

I've been thinking about PII in this context.

If all data is secured by default, then the identification of PII is not about deciding to secure that data, it is about identifying where we might impose (and often this isn't required, but now we can consider it) additional UX burden or complexity in order to add _additional_ security.

neiman · on Aug 25, 2020

If I can't think of a way to abuse me by having my data, it doesn't mean that someone else doesn't. I would really rather avoid all this discussion by them not having my data to begin with.

falcolas · on Aug 25, 2020

If you know of an alternative that offers client-side encrypted sync, I'd love to hear it. I'm considering alternatives to the Kindle as well, even if for reasons unrelated to the analytics.

neiman · on Aug 25, 2020

I wish I'd knew:-)

paxys · on Aug 25, 2020

The data is encrypted.

neiman · on Aug 25, 2020

Encrypted between me and Amazon (such that Amazon could see the content), or encrypted between my devices such that Amazon can't see the content (but only the encrypted form)?

notatoad · on Aug 25, 2020

>the location data doesn't need to be sent with every page turn, for example

why not? if i open a book on my phone that i stopped reading on my kindle, i want it to open to the last location i read to on my kindle. not ten pages back because it doesn't sync data every page turn for some imaginary privacy benefit.

grawprog · on Aug 26, 2020

>To sync a "last read page" across devices, you need to send a location back to Amazon. It's also appropriate to tie a location to a device, so you can pick the appropriate device to sync your position from.

Why is location needed for that? Shouldn't a device id and account work just fine? I don't need to share my location to sync other devices.

gambler · on Aug 25, 2020

Why is syncing across devices not opt-in? Why doesn't Kindle tell you which data it sends and when?

falcolas · on Aug 25, 2020

Sync is opt-in.

And, good question. It would be nice, though I'm sure they've buried it in their multi-page privacy doc somewhere.

EDIT: No, it's not opt-in. Reading failure on my part.

gambler · on Aug 25, 2020

>Sync is opt-in.

https://www.epubor.com/whispersync-for-kindle.html

"And "Whispersync for Books" is enabled on Kindle Fire, Kindle devices and apps by default."

https://smallbusiness.chron.com/amazon-whispernet-work-58992...

"Whispersync is on by default in all new Kindles, but you can turn off the option on individual devices if you have multiple readers attached to your account."

https://ebookfriendly.com/how-to-disable-data-collection-kin...

"How to disable data collection on your Kindle or Fire device"

falcolas · on Aug 25, 2020

Crud, I read that wrong; you're correct in that it's opt-out.

That said, given that it provides a high value to the end user (I use it daily), I personally don't mind.

technofiend · on Aug 25, 2020

The aggravating bit - beyond the fact that Amazon doesn't let you opt out, is that this sometimes affects performance. Switching over to the kindle app occasionally hangs. Killing the app and restarting it usually works, but there are times when I have to go to airplane mode and kill and restart the app just to open a book!

falcolas · on Aug 25, 2020

You can opt out, at least on the physical devices.

But yeah, the Kindle iOS app is crap in many ways - the one that bugs me is how hot it makes my phone. I mean, WTF?

Despoisj · on Aug 25, 2020

As a former Kindle developer, I can say that most of what's mentioned in this article are metrics used to understand how the features are used (bookmarks, highlights, dictionnary, etc.), how much they are used, and in which country. This allows the teams to focus on features that are actively used, and sometimes lead to discontinuing features that see little to no use. Hope that helps.

breakfastduck · on Aug 25, 2020

As many people here have echoed - this boils down to the fact the data is being captured without an opt out.

I don't doubt the developers are using it for 'morally acceptable' purposes, but I don't trust Amazon not to abuse that data later down the line!

I really don't feel that anyone needs to know precisely what pages I have viewed in a specific book.

falcolas · on Aug 25, 2020

The kindle e-readers do offer an opt-out from the metrics collection. It can be triggered from the website or the device itself.

That it's an opt-out and not opt-in is not a good thing, but it can be opted out of on the e-readers.

breakfastduck · on Aug 25, 2020

OK well that's something. An opt-in would be preferred but that's much better than nothing.

Is it confirmed though that these network requests definitely stop after that is switched?

gavreh · on Aug 25, 2020

What are the steps to do this?

ptman · on Aug 25, 2020

https://www.amazon.com/hz/mycd/digital-console/deviceprivacy... ?

zxcb1 · on Aug 25, 2020

Does not work, can you point to a tutorial? And does this include the Kindle app?

xena · on Aug 25, 2020

On my kindle Oasis:

- Go to the homescreen

- Open the hamburger menu

- Tap settings

- Device Options

- Advanced Options

- Privacy

- Disable

andrewmutz · on Aug 25, 2020

That data allows users to pick up where they left off as they change devices.

I rely on that regularly as I use both my phone and a Kindle device to read books.

breakfastduck · on Aug 25, 2020

So you should turn those features on. It doesn't mean I should have to tolerate it by default.

PeterStuer · on Aug 25, 2020

At least for EU citizens the GDPR requires this to be an opt-in, with the option to decline without service degradation.

mc32 · on Aug 25, 2020

Agree. Opt out at the minimum. How did software and features ever get done before telemetry?

Efficiency is not always the best humanistic approach. So maybe they support unused features and maybe they let some features wither that lots of people like. Maybe it would make things cost a little more. I think people would be ok with some of those inefficiencies.

ksk · on Aug 25, 2020

>How did software and features ever get done before telemetry?

IMHO, The software today is miles better at UX.

breakfastduck · on Aug 25, 2020

Is that because of telemetry or just the field developing naturally, though?

ksk · on Aug 25, 2020

No, I don't think its just due to telemetry, I think its a combination of multiple factors as you suggested.

paulcole · on Aug 25, 2020

The opt out is don't buy a Kindle.

hohohn · on Aug 25, 2020

That's how every company rationalizes the mass collection of user data. "Oh lets collect many terabytes of every user-action in case we need to one day discontinue a feature".

It's a book. You don't need to collect and track every fucking action I do to find out if your stupid highlighter is being used in Poland.

jjcon · on Aug 25, 2020

Whether you like it or not this collection does lead to better products - that is why you think every company does it because those that don’t usually die out. Understanding your users is vitally important.

Privacy LARPers are a tiny segment of the market, the average person doesn’t really care if their ‘usage of the highlighter function is tracked’

neiman · on Aug 25, 2020

> Privacy LARPers are a tiny segment of the market, the average person doesn’t really care if their ‘usage of the highlighter function is tracked’

If so, why don't they loudly advertise the data collection and do it only with opt-in?

It's not that the average user doesn't care if they're tracked, it's that they're not aware that they're being tracked.

jjcon · on Aug 25, 2020

You think companies should loudly advertise something people don’t care about? That doesn’t make sense.

Plenty of companies are quite transparent about their data collection practices (set up an Apple device recently?)

Most people are aware of data collection, they care more about functionality though.

shadowprofile77 · on Aug 25, 2020

>Plenty of companies are quite transparent about their data collection practices (set up an Apple device recently?)

I have not, not recently, but what you say is simply bullshit. They're "transparent" in that they give you a ToS loaded with legalese that they know you couldn't easily read through to find just how much and where they're squeezing your life for information to store. In cases where they simplify this with some less legalistic declarations of data use, what you often see there are numerous weasel words and phrases to very ambiguously describe what's being done. You know, things like "We MAY collect some information for the sake of improving user experience" and blah blah....

Then of course, there's the outright lying, which also happens, in which big tech companies simply fail to mention some types of data collection anywhere (the Amazon Alexa voice recordings being listened to by humans is a good example iof this)

jjcon · on Aug 25, 2020

This isn’t buried in a tos or legalese

https://www.groundctl.com/wp-content/uploads/2018/04/csm_IMG...

Apple prompts you for each piece of data collection during the setup of an iOS device (and lets you choose if you want to share).

t-writescode · on Aug 25, 2020

You're presenting the shining example in the corporate world of responsibility with customer data, Apple, with every other company and saying that everyone does it this way?

Most companies hide it in legalese. Some companies claim they're not sending any data and then send it anyway. Looking at you Philips Hue lights.

neiman · on Aug 25, 2020

> You think companies should loudly advertise something people don’t care about?

It's not what I said.

jjcon · on Aug 25, 2020

> why don't they loudly advertise the data collection

neiman · on Aug 25, 2020

This I wrote. I didn't write "companies should loudly advertise something people don’t care about" -> you added something to my sentence, taking it out of context.

I wrote my opinion already, but I'll repeat it anyway in case it was not clear. I think you can't know if people care about it or not, as long as they're not informed about it.

rumanator · on Aug 25, 2020

> If so, why don't they loudly advertise the data collection and do it only with opt-in?

But they do.

https://m.youtube.com/watch?v=yg70ojfWXnk

neiman · on Aug 25, 2020

The video is about synch, while the conversation is about "collection does lead to better products" -> i.e, analytics.

rumanator · on Aug 25, 2020

What do you believe syncing means? This discussion talks about whispersync reporting last page read and most recent page read events. What do you think that's supposed to do?

neiman · on Aug 25, 2020

Syncing and analytics are not identical, sorry.

rumanator · on Aug 26, 2020

You're the only one fabricating accusations about "analysing" in a discussion about how Kindles send data with whispersync, a system widely known to be used to sync data across devices.

More importantly, the only usecase mentioned in the discussion that resembles anything like analysis is synching page reads across devices, and tracking reading progress to compensate authors who make their books available through subscription services.

Either you know stuff about "analysing" that for some reason you're keeping a secret, or you're talking nonsense about stuff you have no grasp over.

neiman · on Aug 27, 2020

Please read the message beginning this thread.

https://news.ycombinator.com/item?id=24271258

It's written there:

> "most of what's mentioned in this article are metrics used to understand how the features are used (bookmarks, highlights, dictionnary, etc.), how much they are used, and in which country."

Besides, I don't appreciate phrases like "fabricating accusations" or "you're talking nonsense about stuff you have no grasp over". I'm may be wrong, it happens often, but even if I am this aggressive tone is not in place. You can point out my mistakes politely if they exist, same way as I do with yours.

danShumway · on Aug 25, 2020

> Privacy LARPers

This is an unnecessarily denigrating term at this point in the conversation. It's not LARPing to want to be able to read a book or take notes without being tracked.

jjcon · on Aug 25, 2020

> It's not LARPing to want to be able to read a book or take notes without being tracked.

Absolutely agree but it is LARPing to pretend this collection is for anything but improving a product. Nobody is out to get you and nobody particularly cares how often you specifically turn the page (the data is useful in aggregate).

danShumway · on Aug 25, 2020

Kindle's privacy FAQ[0] says:

> We also use it to develop and improve products and features for all our customers and to gain insights into how our products are being used, assess customer engagement, identify potential quality issues, analyze our business, and customize marketing offers.

Targeted marketing is, in itself, something that's reasonable for someone to want to block regardless of whether or not there's a mustached villain tracking you. Privacy is about more than stalkers, it's about the effects of data usage. For some people, targeted advertising is a harm regardless of whether or not the company knows their name.

To go a step farther, I also don't understand why it's LARPing to be worried about a company who is actively being investigated for misusing seller data.

I bring this up every time that one of these threads/stories gets posted, but there's (appologies, but for lack of a better word) some kind of weird gaslighting that always happens in these situations. Before it broke that Echo and Siri queries were sometimes listened to by 3rd-party contractors, if I had posted that suspicion on HN people would have called me paranoid. Once the story broke, the argument then shifted to, "well of course they're doing that, how else would you improve the service?" That kind of thinking applies to Amazon as well.

I don't know that it's likely, but I don't think it's outside the realm of possibility that Amazon might use this information in the future to help target pirates, change book rankings on their store, perform highly targeted advertising and book recommendations, or turn it over during government subpoenas. Those are completely reasonable usages that their privacy policy leaves them permission to do.

Similarly, I don't know that it's likely, but it's not outside the realm of possibility that this information might get sent to 3rd parties with less responsible data practices, or that employees might be given direct access to it in an unobfuscated form[1]. It's not something I'm losing sleep over, but I wouldn't be shocked to my core if someday all this information got leaked publicly and correlated to people's email addresses.

These are all situations where privacy matters regardless of the original intention. The "I only want to make my service better" defense applies to basically all data collection that most companies do. Even advertisers use that defense. It's reasonable for people to want to avoid being a part of that.

Of course, it's also reasonable for people not to care, to say that hacking is a risk they're willing to live with, and that they don't mind targeted ads, and that the books they read aren't sensitive. But it's not LARPing if someone has a different opinion on whether or not they want to tolerate that stuff.

[0]: https://www.amazon.com/gp/help/customer/display.html?nodeId=...

[1]: See, https://www.telegraph.co.uk/technology/2017/12/12/creepy-net.... Is it LARPing for me to be weirded out by a marketing department trolling over my reading/listening/watching habits looking for viral tweet material?

choward · on Aug 25, 2020

> Whether you like it or not this collection does lead to better products

Maybe it's just me but every tech product I use these days gets worse over time. If something does get better, two things get worse. They mostly try to optimize for user engagement and not user experience.

> Understanding your users is vitally important.

And the only way to understand people is spying on them?

jjj123 · on Aug 25, 2020

There’s an important distinction to make: this tracking doesn’t necessarily lead to better products, it leads to better business metrics.

Sometimes a better product comes out of better business metrics, but other times they’re directly opposed.

Google234 · on Aug 26, 2020

This is not true. What if for example you want to make a change to the dictionary feature because you imagine that it’s not useful and should be less prominently accessible. How would you measure if this is a good idea or not without tracking its use? This has nothing to do with business and everything to do with making the product better.

jjj123 · on Aug 26, 2020

Sure, there’s an example where best case the user experience is improved and business metrics aren’t affected. But I assure you if that app has a decent analytics setup they’ll also be tracking business metrics, and if for some reason business metrics went down with that change past some acceptable threshold, that change won’t be launched.

Now if you look at opposite case, where a feature is worse for user experience but helps business metrics, that feature will definitely be launched. A small, mostly harmless example: Ever tried to hide twitter’s recommended accounts? It gives you the option to “see less often”, but curiously there’s no option to stop seeing the window forever. Why? Because clearly it benefits twitter’s business on average to keep showing these recommendations.

I’ve built enough dark patterns at my last job to know it always comes down to business metrics.

choward · on Aug 25, 2020

Exactly. At the end of the day it's about profit and not necessarily a better product. Sometimes more profit means making a better product for the end user.

PeterStuer · on Aug 25, 2020

'Privacy LARPers are a tiny segment of the market, the average person doesn’t really care if their ‘usage of the highlighter function is tracked’'

Which is exactly why we have regulation that forbids these practices, to protect the gullible from themselves. Furthermore, do you think privacy should be the privilege of just those that are smart and keen enough to be aware and prepared to engage in a relentless and perpetual battle with the most dark of patterns with every click they make?

jononor · on Aug 25, 2020

Do you have something to back up the claim that this kind of data collection leads to better products?

ihm · on Aug 25, 2020

This comment is such cowed boot-licking of a giant corporation. Completely antithetical to the hacker ethos.

jjcon · on Aug 25, 2020

One can partake in the hacker ethos while not being a conspiracy nut - albeit I admit those sometimes go together.

lwouis · on Aug 25, 2020

Most of the world-famous libre software is built without their developers study of massively collected usage data ("telemetry").

I look at VLC as a great example to follow. Their stats show 3.4 billion downloads (https://www.videolan.org/vlc/stats/downloads.html), yet they do no telemetry at all. The product works great. It could be improved of course, but Outlook could also greatly be improved, and they have high-salary staff and a boatload of data they extract from users. Yet it's slow as hell and has lots of UX I disagree with.

I'm myself the author of a replacement of Windows "alt-tab" on macOS (https://alt-tab-macos.netlify.app/) which doesn't do any telemetry. I can lead the roadmap, with the help of the community, without spying on how users set their preferences and use the app.

As a matter of fact, it can be argued that acting that way can be negative value as it's reinforcing popular usage; or from the power-users perspective, dumbing down the software. By definition, advanced features will have low usage. It doesn't mean it should be removed.

Lastly, think about non-software businesses. Many amazing products have simply no way to gather data when the products are in the users homes. They rely on gathering data by talking to customers at the points of purchase, customer care, are in various forums with enthusiast users. This model has shown great results, so it is in no way clearly to be avoided in favor of telemetry-everything.

rumanator · on Aug 25, 2020

> Most of the world-famous libre software is built without their developers study of massively collected usage data ("telemetry").

The sort of telemetry mentioned in the article is used for UX purposes, and God knows FLOSS sucks at UX.

And by the way, Debian collects and reports telemetry since the early 2000s, and Firefox is quite open on how much telemetry it collects.

2rsf · on Aug 26, 2020

TBH the argument that it reinforce popular usage is a valid one, at MS we were taught again and again on how to design good experiments using telemetry but at the end it's hard to support changes when your data shows that something is working properly, and UI changes tend to produce a dip in usage or satisfaction graphs until they catch-up.

Google234 · on Aug 26, 2020

VLC’s UI is horrible.

jacquesm · on Aug 25, 2020

It doesn't really matter does it? You don't collect data without consent, period.

Why is that so hard to understand?

Why don't developers ever push back against this sort of thing? Collectively we build this stuff, we are not 'soldiers following orders' which makes us responsible for what we create.

The current actual use is not relevant. Consent and the possible uses are relevant.

thdrdt · on Aug 25, 2020

I think your comment is unfair.

Every webserver logs the IP address and the URL visited. Do you think most people know this? Do deverlopers push against this?

kofejnik · on Aug 25, 2020

strawman; you visit someone else’s server, and therefore they get data about your visit; with kindle, you’re using your own device and there’s no expectation that amazon will be snooping

thdrdt · on Aug 25, 2020

"you visit someone else’s server, and therefore they get data about your visit"

I don't think the average person knows this. A lot of people even have no clue about internet. So there is no consent most of the time. And we, the developers, just let the logs running.

"with kindle, you’re using your own device and there’s no expectation that amazon will be snooping"

Well I would absolutely have this expectation. I expect a device that is connected to the internet snooping on me. Then there is the Amazon brand. I absolutely don't trust them so I expect them to snoop in me.

But to be clear: I absolutely hate that my privacy is gone. I use all kinds of blockers to disable tracking and I also agree with jacquesm snooping is wrong. But I still think his point is too black and white and therefore unfair.

ancarda · on Aug 26, 2020

>Every webserver logs the IP address and the URL visited.

I maintain a webserver - https://git.sr.ht/~ancarda/tls-redirector - that has no support for logging. If you wanted logs for some reason, you'd need to modify the source code to add that functionality.

Granted, tls-redirector isn't a general purpose webserver, but even in production I tend to turn off logging. I just don't see the need to have logs lying around that I never use.

jacquesm · on Aug 25, 2020

No, not every webserver does. This is something that you could easily configure.

Yes, most people know this by now.

Yes, some developers push against this.

Also: It's the law. Collecting data without consent is not always legal. Whether that particular bit of data rises to the level of requiring consent is left as an exercise for the reader for their particular jurisdiction and industry.

gizmo · on Aug 25, 2020

GDPR actually forces all websites to carefully keep track of what gets logged and for how long these logfiles are retained. So yes, legislators are pushing back against the common practice of logging everything just cause.

hans_castorp · on Aug 25, 2020

> You don't collect data without consent, period.

This.

AdmiralAsshat · on Aug 25, 2020

I think the privacy-concerned end-user thinks, "Yes, I completely understand why this information is being tracked and how it would be useful to Amazon. But I still don't like it."

neiman · on Aug 25, 2020

As a freedom-concerned citizen, I always completely understood the policies and methodology of dictators and tyrants, and how what they do is useful for them.

jjcon · on Aug 25, 2020

Quit LARPing - Amazon isn’t trying to take over the world by tracking how often you use the bookmark feature.

Legogris · on Aug 25, 2020

Or "It's all fine and dandy today, but what about in x years when there's a new person/group with different incentives in charge?"

ethbro · on Aug 25, 2020

I'm surprised no one brought up revenue sharing.

I was under the impression there was a revenue-allocation problem that Amazon needed to solve (Kindle Unlimited subscriptions?), that depended on reliable reading statistics. E.g. How many people read book A?

Wish I could find the article, but the implication was there were a ton of publishers attempting to game the system. For example, by publishing blank, very long "books" and having them "read" by software automation.

neiman · on Aug 25, 2020

How does it make a difference?

First, if an entity want my input and are going to use it, they should be decent enough to pay me for giving it. Why do users need to work for free for Amazon?

Second, is it opt-in? If not, then there's an ethical issue here, even if a manual opt-out option is given (does it?). If there's no opt-out, there's a double ethical issue.

Thirdly, is this data deleted once it's being used for the goals you mentioned, or is it kept, making it a risk both for leaking and for Amazing deciding to put it for a different usage in the future.

dimitrios1 · on Aug 25, 2020

You don't. You have 100% freedom to not work for Amazon. Don't buy a kindle. Don't use a kindle.

neiman · on Aug 25, 2020

If I would have known that by buying Kindle I end up working for Amazon, I indeed wouldn't have bought one.

It's deception. Please put on the box a big warning, "THIS DEVICE COLLECTS YOUR DATA", similar to those on cigarette boxes.

shmel · on Aug 25, 2020

Are you genuinely surprised at this point? Pretty much all big tech companies were caught outright lying about user data collection. Why would you assume by default they don't try to get as much as possible? They are all based on ML, of course they do.

A year or two ago Amazon was swearing that humans don't listen to Alexa conversations until we learned they actually do. IIRC Amazon tried to backpedal: "of course they do, it is their job, we meant humans don't listen _for fun_".

At this point just assume the internet connectivity as such a warning.

charles_f · on Aug 25, 2020

> Pretty much all big tech companies were caught outright lying about user data collection.

You can strip the big here.

neiman · on Aug 25, 2020

Of course I'm not surprised, but I refuse to accept this as normal.

TedDoesntTalk · on Aug 25, 2020

But your refusal doesn’t change the reality.

Kinda like refusing to believe that climate change is real does not change the reality.

neiman · on Aug 25, 2020

What? I didn't say I don't recognize the reality. I said I don't accept it as normal, meaning I work trying to change it.

radicaldreamer · on Aug 25, 2020

There’s a plastic bag over the product saying don’t open it if you don’t agree with the terms of service and that it’s required to use the device.

Also, plenty of people just leave the kindle in airplane mode and use third party software like Calibre to manage their libraries.

falcolas · on Aug 25, 2020

FWIW, the website providing this breakdown also collects analytics data without a warning. So, there's that to consider as well.

jjcon · on Aug 25, 2020

It’s called the terms of service?

jabirali · on Aug 25, 2020

Terms of service are written to be understandable by lawyers, not average end-users. At this point, understanding every terms of service, privacy policy, etc. presented by every piece of software, website, etc. encountered by an average user would require them to spend hours per week on it. This is assuming that they even have the language skills necessary to decipher the document (think of non-native English speakers, people without higher education, and so on.)

Creative Commons was on the right track with their human-readable licenses, see e.g. this example [1]. Apple is on the right track with their App Store "nutrition labels" [2]. This is what we need for people to make informed decisions. For physical objects like a Kindle, I believe such "nutrition labels" should ideally be put on the box (physical store) and website (online stores), so the consumer is aware before they go home and turn on the device (this makes it easier to compare the Kindle to a Boox or Nook at the store).

[1]: https://creativecommons.org/licenses/by-nc/4.0/

[2]: https://mashable.com/article/apple-privacy-nutrition-labels-...

ethbro · on Aug 25, 2020

ToS are effectively useless for this purpose.

If the industry moved to a standardized disclosure form (e.g. something like the HUD-1 [1] in real estate sales), people would stop complaining about this.

[1] https://www.hud.gov/sites/documents/1.PDF

oblio · on Aug 25, 2020

1. Nobody actually reads Terms of Service (well, governments and some major businesses do, but 99,99% of regular users don't).

2. Nobody reads them because most of the time they are explicitly user hostile, I'm pretty sure they are designed to prevent users from reading them.

violetgarden · on Aug 25, 2020

Yes! Even when I try to read the terms of service, I find them hard to understand. I feel bad because it’s sort of shame on me for agreeing to stuff blindly. User hostile is a good way of putting it.

danShumway · on Aug 25, 2020

Are they printed on the box in a readable form before the customer buys the product?

neiman · on Aug 25, 2020

Very different things.

2rsf · on Aug 26, 2020

Payment is a fair point on Kindles, I get why web sites offers free services in return to commercials (and your data) but I paid for my Kindle and (most of) the content I read.

belorn · on Aug 25, 2020

I don't think that will ease anyone with privacy concerns. People who are against government surveillance is not against the police catching criminals and solving cold murder cases. The Golden State Killer case was a very good use of DNA profiling and DNA databases being used to catch a criminal. The problem is that many don't trust the government to only use it for those cases, and many others don't trust the technology to have a low enough false positive rate to not cause harm to innocent people.

Understanding how the book reader features are used in practice is good. Selling the same data to a advertiser is bad. Profiling people into predefined groups is bad, and the technology has risk of having false positives/negatives that reinforce stereotypes. The law has yet to catch up to treat information gathered by libraries and information gathered by a developer of e-readers as being very similar in risks.

mumblemumble · on Aug 25, 2020

We can step outside of government examples, too, and find cases where corporations getting all data sciencey with this information have accomplished some pretty ucky - and also impossible to anticipate - things.

An instructive case here is Target figuring out that they could use customer purchase history to detect, with a pretty decent degree of confidence, when a customer was pregnant. They then proceeded to use this model to send out mailings, and those mailings resulted in people being outed in rather compromising and potentially seriously harmful ways.

Cyphase · on Aug 25, 2020

Here's the Target story from 2012: https://www.nytimes.com/2012/02/19/magazine/shopping-habits....

tgv · on Aug 25, 2020

IP address, country, goodread account details, each page turn, exact page location, etc., seem unnecessary for that.

iso1631 · on Aug 25, 2020

Page location and page turn in there for syncing across devices, that's fine - ask the user 'sync across devices', if they say yes, not a problem. if they say no, don't send the data. Data that is stored would be something like 'currentlocation[$bookid] = $location'. Storing historical information (user was at location 1219 at 2020-01-06-05:12:41) is not required for that function.

Philosophy should always be store the minimum amount of data to provide the function that the user wants.

IP address is transitory and shouldn't be kept longer than needed for the tcp session, maybe it sticks in firewall logs, but that shouldn't be used for anything other than security.

goodread account details would only apply if you connect to goodread, I'm not sure what the benefit of that is, but I could see that 'user abc123 read this book' is useful data - again ask if you can send the data.

Kaze404 · on Aug 25, 2020

Fair enough. How do I turn it off?

_lqaf · on Aug 25, 2020

The primary way that helps is to communicate that everyone on the team appeared to think this is perfectly acceptable to do without communicating it to the paying customer.

I mean, we already knew this, but it means any and all Amazon hardware must be considered potentially hostile.

iso1631 · on Aug 25, 2020

Almost all hardware and all software (especially software as a service) should be considered potentially hostile.

pilsetnieks · on Aug 25, 2020

It's not about how it is used, it's about how it can be used (especially when a less benevolent entity gains access to it.)

sumtechguy · on Aug 25, 2020

They have collected large amounts of data from pretty much day one on those devices.

Back when they had a cell phone in them. I was standing behind a guy who was supporting it. "Uh lets bring up where you are at? It says you are 10 miles off the coast of miami?...." "oh yeah I am calling from my yacht" "do you see any cell towers?" "no" "It kinda needs those to work. I am surprised I got the location data."

api · on Aug 25, 2020

Privacy concerns are usually about how information could be misused, not how it's used right now or routinely.

zxcb1 · on Aug 25, 2020

A Kindle comes with Kindlings, a lesser form of the book, where you are being read by Amazon while reading; you are working for Amazon in ways you might never understand.

The Kindling never leaves Amazon properties; it is not yours even though you paid almost the full price of a book.

If there is rule of law in the US and EU, these will eventually become free e-books, that is, separated from Amazon; they will regain the status and properties of the book.

taneq · on Aug 26, 2020

This is why you keep your e-books stashed on media you control, and put copies onto your Kindle when you you want to read them.

Same with any data you store on an iOS device. You never let a device you don't control have the only copy of any data important to you.

zxcb1 · on Aug 25, 2020

For example, readers might want to integrate their libraries into the knowledge base of their personal AI.

raxxorrax · on Aug 25, 2020

I don't care how they are used honestly, I care about options to disable it.

moksha256 · on Aug 25, 2020

Yeah I came here to say the same. I'm about as tin-foil-paranoid-privacy-all-the-things as they come, but the "invasive" data mentioned in the post don't seem particularly invasive to me, and collecting that data seems perfectly appropriate for the purposes you mentioned.

With all that said, I do dream of a PINE64 E Ink device (or something that's open and hackable).

enchiridion · on Aug 25, 2020

Remarkable is open and hackable.

https://github.com/reHackable

jjcon · on Aug 25, 2020

It also costs more than an iPad and has terrible response times

enchiridion · on Aug 25, 2020

Yep, pretty consistent for e-ink.

Still, I think it has the best value proposition for an e-ink tablet at the moment, but I'd love to be proven wrong.

jjcon · on Aug 25, 2020

Probably true - I’ll snatch it up the moment color e-ink is a thing, color is vital for most of the papers I work with and for books I prefer a smaller form factor so from my perspective it sits in kinda an odd part of the market.

enchiridion · on Aug 27, 2020

Color e-ink is close, which is really impressive imo. I did not expect to see it for years.

Who knows how long it will take to get good enough yields for affordable consumer products.

https://www.eink.com/color-technology.html

gnusty_gnurc · on Aug 25, 2020

Yea analytics like this are really what I find to be so important, as a developer.

How much time and frustration do I potentially waste on something that no one ends up using?

Things like this are very useful and it's strange to me that people aren't sympathetic to that perspective.

taneq · on Aug 26, 2020

I think a lot of people are sympathetic to that perspective while still wanting control over their privacy.

It's the difference between someone inviting you to come into their home for a visit, and you breaking in whenever you feel like to take notes on what they're doing.

pseudalopex · on Aug 25, 2020

It's strange to you people care more about their autonomy than your convenience?

Telemetry can tell you what users are doing. It doesn't tell you why.

gnusty_gnurc · on Aug 25, 2020

I'm saying as someone who works in software I empathize with the idea of spending lots of time implementing a feature, tearing hair out over some technical issue, etc. only to realize no one uses that feature.

I'd rather people be able to opt-in, but conceptually I'm not really upset that people can see my usage patterns, etc.

pseudalopex · on Aug 26, 2020

I think most of us work in software. Asking for consent isn't hard.

Telemetry won't tell you nobody wants a feature you haven't implemented yet. User research might.

Shared404 · on Aug 25, 2020

> the "invasive" data mentioned in the post doesn't seem particularly invasive to me[.]

Attempting to get the subnet IP address? That seems pretty invasive.

From the article:

> Attempt to get the IP address on the local network (a 10. address, which was incorrect for me)

eclipxe · on Aug 25, 2020

What, exactly, will that do for them?

Shared404 · on Aug 25, 2020

That's my point. The data is both A) Invasive and B) Pointless, unless trying to do things they shouldn't on your network. But they still collect it for some reason.

neiman · on Aug 25, 2020

> don't seem particularly invasive to me, and collecting that data seems perfectly appropriate for the purposes you mentioned.

Fine. So you allow them to collect it. However, don't decide for others if it's "invasive" or "perfectly appropriate" for them or not. Do it opt-in such that people who wants to share their data could do that.

Oh yeah, and offer them payment for that. They deserve it.

gvjddbnvdrbv · on Aug 25, 2020

There are some features in software I rarely use. But those times I do use them they are utterly essential. If I find such feature has been removed I am incensed.

Usefulness is NOT the same as usage.

jjcon · on Aug 25, 2020

> Usefulness is NOT the same as usage.

Metrics can tell that story though so you’re arguing a straw man.

Example: If you see that 99% of users have never used a function ever - you have a pretty good idea that it needs to be reworked or removed. You may also see a function that is used by 80% of users once a month, that you may opt to keep.

vharuck · on Aug 25, 2020

It's not so much that ubiquitous telemetry can't identify this, it's whether it's better for this than a focus group. You can have background telemetry with the focus group so you're not just giving customers what they say they want instead of what they need.

jonathanstrange · on Aug 25, 2020

I'm not sure. While I understand that developer time needs to be cut down or restrained sometimes - though perhaps not at Amazon in this case, which concerns their core business -, your example could merely turn out to be a way of losing 1% of the users. Usage statistics alone cannot tell you whether your users hate or like a feature. Some features are always going to be used more than others.

Google234 · on Aug 26, 2020

What if that feature costs 30% of dev time? Without being able to measure you wouldn’t be able to make a good judgement. Imagine how science would work without experiments?

gvjddbnvdrbv · on Aug 26, 2020

Wouldn't focus groups work better AND respect your users?

Devs think it is either telemetry or develop blind but in reality software was developed (and possibly was better) before telemetry using focus groups.

djsumdog · on Aug 25, 2020

Don't care. Still hate it. Why not add in an opt-out of metrics in the preferences?

whoopdedo · on Aug 25, 2020

As a developer, that is how _your dev team_ used the data. Can you confidently say that the metrics weren't also being accessed by the marketing department for different purposes? Or that it wasn't being shared with Amazon's business partners?

Mediterraneo10 · on Aug 25, 2020

I have quite often seen people here and on other tech forums assume that purchasing a Kindle means being locked into Amazon's ecosystem, giving up personal details, and having the risk that your books might be deleted. But you don't have to use the Kindle's internet connectivity: I have owned three generations of Kindle, and with each one I activated airplane mode the second I unboxed the device and I never turned airplane mode off. All my ebooks come from sources other than Amazon (mainly LibGen, for example), and they can be easily transferred over to the Kindle by USB because the Kindle appears as any ordinary USB drive to a computer.

belorn · on Aug 25, 2020

If this practice ever get wide spread I would guess that the developers will limit airplane mode in someway in order to ensure that the device will call home at some point.

But it is a pretty clever hack to get a hostile machine to not connect to the internet as airplane mode is (I assume) regulated behavior.

filesystem · on Aug 25, 2020

Even if the developers take the egregious step of nerfing airplane mode, you can still "opt out" by not giving the device credentials for your WiFi network.

gvjddbnvdrbv · on Aug 25, 2020

Only a matter of time before devices come with 5G data connections...

falcolas · on Aug 25, 2020

AKA, the original whispersync.

Yup, this was once a thing - you didn't need wifi for sync or downloading books at all.

iso1631 · on Aug 25, 2020

Kindles at one point apparently came with free cellular access

https://xkcd.com/548/

int_19h · on Aug 25, 2020

They still do - it's an option on more expensive devices (Paperwhite and Oasis).

sct202 · on Aug 25, 2020

I had a kindle keyboard and it had 3g. It worked in a bunch of countries--slowly though. I remember reading blogs where people were taking the sim cards out and tethering using them.