Google to stop selling ads based on your specific web browsing

samizdis · on March 3, 2021

There's an archived, no-paywall, version here:

TheRealDunkirk · on March 3, 2021

I remember a small story, decades ago, about a company/univeristy which was left holding the bag when the government cancelled their contract. They were constructing a lens system to go into a satellite which could purportedly read the issue date on a dime lying on the sidewalk. It was something like $15M. The implication was that the government had found something even better, and gone with that instead. If Google is publicly giving up on a particular, unpopular technology to buy some PR, I strongly suspect that they've already figured out something which will work "better" behind the scenes.

There's a lot of confusion over what this story is actually saying, and, given the obscurity with which Google deals with it's tech, I don't think this is going to get cleared up, leaving room for a lot of speculation. But one thing is absolutely certain: There's no possible way that Google is going to make a change to their biggest money maker that would make them LESS money. Maybe that just means that they consider their duopoly enough of a barrier that advertisers will be forced to accept less-targeted marketing, but I doubt it...

btown · on March 3, 2021

Yep - without having read the article due to paywall, they're likely referring to FLoC (Federated Learning of Cohorts) https://github.com/google/ads-privacy/blob/master/proposals/...

https://blog.google/products/ads-commerce/a-more-privacy-fir...

Make no mistake - as far as I can tell, this preserves Google's ability to track that you are, say, a new parent because you've searched for baby clothes. What it won't know is which new parent you are. The system is designed to give probabilistic assurances of k-anonymity. But Google will no doubt tune those "cohort" memberships, and the value of "k," to capture the vast majority of current advertiser needs, while still being able to communicate to antitrust inquiries and the public that they are not giving people unique identifiers. If anything, it hurts their competition more than it would hurt them, because it allows them to thread the needle in a privacy-conscious world.

sjs382 · on March 3, 2021

> without having read the article due to paywall,

https://archive.vn/qlvcE

1vuio0pswjnm7 · on March 4, 2021

Solution #1: Remove Javascript and CSS.

     curl -s https://www.wsj.com/amp/articles/google-to-stop-selling-ads-based-on-your-specific-web-browsing-11614780021 \
     |sed -n '/<meta property=.og:title. content=./{s//<title>/;s/\" ./<\/title/;p;}' \
     |egrep -o "(<p>.*</p>)|(<title>.*</title>)"  > 1.htm
     firefox ./1.htm

Solution #2: Use browser that does not process it.

     links  https://www.wsj.com/amp/articles/google-to-stop-selling-ads-based-on-your-specific-web-browsing-11614780021

Unfortunately some countries block archive.is websites (e.g., at DNS-level), so archive links will not work for everybody.

attilaberczik · on March 4, 2021

Solution #3: Use Bypass Paywalls

Free extension that can remove paywalls from many of the big news sites. Works on Chrome, Edge and Mozilla.

https://github.com/iamadamdev/bypass-paywalls-chrome

nullserver · on March 4, 2021

Still used to the 90s when any Level of bypassing would get you labeled as a hacker, and people would demand you get fired or arrested.

Just knowing how things worked resulted in a number of people assuming I could hack banks, and either asking me too, or shunning me. Or flat out accusing me of it.

Umm you could go around mugging people much easier then I could hack stuff. That explanation never went over well.

Had to ask one boss to stop threatening employees to have me hack them. It was in jest, but I was terrified at how people would react if they thought I bypass security. Didn’t help that I knew how, as security was crap back then.

I leaned to keep my mouth shut, and refused to keep up with security stuff. Can’t hack if I don’t know how.

1vuio0pswjnm7 · on March 4, 2021

To each her own, but I see the complex browser such as Chrome, Edge and Mozilla as part of the problem. Javascript too. Neither is inherently problematic, but both are too often used to create problems for users (to serve advertisers' interests) rather than to solve them. For example, without Chrome/Edge/Mozilla plus Javascript, "paywall" does not work. Yet I can still read the WSJ article just fine with either Chrome/Edge/Mozilla nor Javascript.

Much like a tech company that wants www readers to believe it is working to solve a problem www users are having that the tech company itself created (which, it just so happens, is not a problem at all for advertisers).

jonahbenton · on March 4, 2021

Useful, bookmarked your comment, thank you.

darren_ · on March 3, 2021

> Make no mistake - as far as I can tell, this preserves Google's ability to track that you are, say, a new parent because you've searched for baby clothes. What it won't know is which new parent you are.

If you've searched for baby clothes in Google there's absolutely nothing in the FLoC proposal that stops Google from inferring that you're a parent from that search and, idk, storing a bit in your profile or something.

This proposal is about allowing ad targeting to interests or whatever without tracking your visits across the web - your browser generates the cohorts clientside (idk how the cohort assignment algorithms get standardised on, I guess that's where the 'tuning' would go). And yes, you could just tell your browser not to do that, or have it only use the most recent n days' data, or randomize your cohort, or freeze your cohort in time, or blocklist certain sites from ever entering your cohort, and so on.

Where this ties into tracking if you're a parent is that while it doesn't prevent tracking that, it does lessen the incentive - if you're a parent, you're likely going to wind up in a parent-y cohort. Is the signal from the original search still going to be useful enough to warrant keeping around?

(obligatory: googler, but nothing to do with ads)

tomjen3 · on March 3, 2021

If the browser is supposed to send the information, what are they going to do with those of US who do not use Chrome? Are there plans to force firefox to follow the plan, and how will they prevent others from, say, making an extension to mess with the cohort numbers?

darren_ · on March 3, 2021

I think the expectation is other browsers will implement by default - the intention incidentally is that FLoC isn’t a google specific tech, it enables private targeting by any ad vendor. The proposal seems to mostly to be by googlers but the association is with WICG which seems to have people from all over.

As I noted in my original message - yes, its client side, you can do whatever. (‘Whether the browser sends a real FLoC or a random one is user controllable’ https://github.com/WICG/floc)

curryst · on March 3, 2021

The user controllability is genius, that neatly ties up their problem. They have a legitimate option to get out of being targeted, which they know almost no one will ever use. They could probably even add a setup prompt on install that says something like "Let us know which of these categories apply to you, so we can tailor your browsing to your interests" and get people to fill out half of it for them so they don't have to guess.

ogre_codes · on March 3, 2021

> "Let us know which of these categories apply to you, so we can tailor your browsing to your interests"

This prompt will never happen. The last thing Google wants is any additional association between their brand and advertising.

Google's mode of operation has always been to sneak in the back door and count how many boxes of cereal you have while you are looking at underwear. Knocking on the front door and to ask if they can come in would likely never occur to them.

There is no mention of Chrome being an ad supported product anywhere when you install it. They want that association completely out of people's heads.

GrifMD · on March 4, 2021

This is a misunderstanding of what these cohorts are. If you read the [FLEDGE readme](https://github.com/WICG/turtledove/blob/master/FLEDGE.md) you can see that there are multiple types of owners of cohorts that fall into 3 categories:

1. Advertisers - who would add you to a cohort they own, so say "Nike - womens-running-shoes"

2. Publishers - who'd want these cohorts to better allow for advertisers to advertiser on their site, so "NYTimes - business-section-reader"

3. Third party ad-tech companies looking to create audiences for advertisers who work with them, so "Example Agency - mens-formal-wear". They'd partner with publishers so that when you go to somewhere like GQ or Man of Many and read an article on tuxedos then you'd get added to the cohort.

So you are right, you'd never see a prompt to add in your interests, but this isn't because Google doesn't want to associate with specific brands, it's that Google isn't necessarily an owner of a cohort (though they totally could create their own under any 3 of those categories).

estaseuropano · on March 6, 2021

I would bet many will be highly local though, by city/neighbourhood/block level

prox · on March 3, 2021

This would hopefully kill the endless cookie questioning

toxik · on March 3, 2021

Cookie popups are now part of our superstitions, our Internet rites. A decent web page, it doesn't just track you and molest your integrity -- it asks first!

I don't think we'll get away from the cookie popups, ever. It's just like the "I agree to the TOS / EULA" checkboxes. Legally toothless, but you'd better have one, just in case.

btown · on March 3, 2021

And cookie popups are now part of any corporate general counsel's requirements if you're even running first-party cookies and/or any kind of analytics, regardless of third-party-cookie changes. They're not going anywhere.

Angostura · on March 4, 2021

I disagree. There is are many knowledgeable people out there that spend their time looking at the requirements of GDPR etc. and how those relate to cookies. If you can be properly advised that the kind of cookies you use aren't contravening the various privacy directives, it won't be long until they disapear.

seanhunter · on March 4, 2021

I know one place where they were forced to add a cookie popup as part of a GDPR compliance tickbox exercise. The kicker? They didn't have any cookies. After a fruitless attempt to persuade the person working through the aforementioned exercise that you only need a cookie popup if you have cookies, the popup was implemented.

Of course you need to store the user's preference. So now the site needed to have a cookie. One cookie which just stores the fact that the user has or hasn't consented to cookies.

gsich · on March 3, 2021

This could easily be solved with DNT.

Lio · on March 4, 2021

It could have but because DNT wasn’t made part of the GDPR and sites refused to do the right thing on their own it went no where.

Even cookie pop ups are not respected. If you visit Facebook sites by accident they’ll leave their tracking cookies before you’ve had a chance to read the terms and leave the site.

gsich · on March 4, 2021

I know. But it doesn't matter as DNT is universal. If you enable DNT, websites SHOULD (or better: MUST, but unrealistisc I know) honor that. For technical/functional cookies you don't need permission. The rest is off, DNT answers the question of "Do you accept" with a browser-based "No.".

graphide21 · on March 4, 2021

Chrome isn’t critically important to Google’s advertising and data initiatives. It’s important for a lot of other things, but that’s beside the point.

Google doesn’t really need much more that what it has. It has third-party big data, Gmail, Google Search and a very recent optimized cache of everything important that it’s crawlers can get and more. People get so caught up in the minutia; Google is big, like category 100 Hurricane big, where people laugh and say there is no category 100; but that’s what it is- classification system aside.

Anti-trust and pro-privacy are distractions that Alphabet, Google, etc. can put their left-fielder on while they run the bases, because they play both teams.

kevin_thibedeau · on March 4, 2021

They need a crippled browser that can't be used to bypass their revenue stream. Controlling the most popular browser gives them an important moat.

contravariant · on March 3, 2021

Presumably people would just block the ads instead?

Either way I reckon it's the same thing stopping people from just sending random data over google-analytics. So, not a whole lot, other than whatever anti-spam mechanisms google bothered to put in place.

sandworm101 · on March 3, 2021

>> just sending random data over google-analytics.

Back when I was in the security field, I heard discussions about this sort of thing as a form of attack. The question was whether a computer that sent false or incorrect information was either engaging in a DOS attack or was violating the CFAA in that it was using false information to access a service. The DOS attack would be on the tracking system rather than a website resource, false information to degrade the effectiveness of the tracking system overall. The CFAA angle would be that participation in ads/tacking was part of the contract for accessing the 'free' service, that by blocking ads or sending false info the user is using a false credential to access the service. At the time, many thought that by using an ad-blocker that users were failing to pay, passively stealing website content. Those who sent false information were engaged in active theft.

contravariant · on March 3, 2021

I'll grant that sending fake data is more active than merely not loading ads, but I personally consider not loading ads about as malicious as failing to read a billboard, or failing to subscribe when a webpage says 'Subscribe now'.

I don't think I'll ever quite understand the idea that the user is under any obligation to do whatever. Just because their user-agent might have some default behaviour doesn't mean that the user-agent should be expected to act in the interest of the server.

Though there's an argument to be made that sending lots of false data in response to a request to post personal information to google-analytics is a bit like manipulating a public survey by sending in silly answers. I'm not sure to what extent that is illegal but I'll grant that it's not exactly ethically correct.

sandworm101 · on March 4, 2021

>> not loading ads about as malicious as failing to read a billboard

Because, at this time, ad blocking tools are relatively benign. But what if we think of them more as content-blocking or content-management tools? What if I have a tool that blocks only right-wing advertisements or content? What if I have a tool that blocks all images of women showing too much skin? Block images of black people? Such not-neutral tools would radically change public perceptions. Legislation might follow.

There was a comical toll out there that would replace the text "Trump" with "The idiot" on viewed websites. That's the thin edge of this wedge.

contravariant · on March 4, 2021

Just because something is evidence of objectionable political view doesn't mean it should be forbidden.

michaelmrose · on March 3, 2021

This is approximately the time when you go from passively not agreeing to be tracked or have your attention stolen or your computer compromised by ad networks to collaborating with your fellow citizens to destroy the business of the ad networks by making it difficult to impossible to operate. In truth if every free ad supported service ceased to exist the internet would go on none the less.

btown · on March 4, 2021

I'm deeply interested in these issues as well - the criminal aspect of the CFAA being disconnected from actual harm caused to people is insane to me.

There does seem to be momentum towards a narrow reading of the CFAA when it comes to providing false information against terms of service, though, as per these 2013 analyses... though they fall short of actually giving any guarantees.

https://www.whiteandwilliams.com/media/alert/247_Coverage%20...

http://tsi.brooklaw.edu/cases/matot-v-ch

On the other hand, a denial of a motion to dismiss in Ticketmaster v. Prestige from 2018 seems to have drawn a distinction between breaching terms of service alone, and being told in a cease-and-desist not to breach those terms; the latter more clearly outlines what would be considered exceeding authorized levels of access.

https://casetext.com/case/ticketmaster-llc-v-prestige-entmt-...

Where does false information end and privilege escalation begin? Can that question depend on whether your privilege escalation is something as simple as "I'm now able to avoid the fine-print contract that I need to watch certain types and customizations of ads in order to access the service?" What constitutes sufficient notice to a user that certain actions are explicitly forbidden, if a C&D does but terms of service do not? All questions that, as far as I can tell, haven't been fully answered.

(Obligatory: Not a lawyer, the above is not legal advice.)

TeMPOraL · on March 4, 2021

> The CFAA angle would be that participation in ads/tacking was part of the contract for accessing the 'free' service, that by blocking ads or sending false info the user is using a false credential to access the service.

I'm horrified whenever I see people having this perception of the issue, and worried that it'll get accepted by the legal system as the correct view.

From my POV, the only contract that exists between me and a random ad-loaded website is the one negotiated between my computer and their server - that is, the HTTP protocol, which clearly stipulates that I can render any reply you send me in whatever way I like. This contract provides many tools for expressing the intent of gating content behind some requirements (like payment, or receiving different content first), but the common rule is: you don't send the content you're gating before the client proves they've met the requirements.

And honestly, I'd consider mixing ads into content to be borderline CFAA (the ads themselves being abusive and often fraudlent, and sometimes malware) - and sending software intending to compromise my user agent (like trackers, or ad-blocking busters) to be crossing the CFAA line. Unfortunately, I don't think the lawyers would agree with me :(.

sandworm101 · on March 4, 2021

Are you horrified when a movie theatre doesn't want people sneaking in their own food and drink? Some consider that trespass in that you have broken the contract you agreed to when buying the ticket. They can call the cops and have you arrested for accessing the theatre in contravention of the rule. I could see an equivalent for websites that adopt a "no ad blocking" rule.

xodice · on March 4, 2021

Can you cite this actually happening?

If you are asked to leave and do not leave I can see you possibly being arrested for trespassing, but I fail to see what you would be arrested for by bringing in food from outside the theatre. Sure, they could ban you from coming back to property (and this could lead to trespassing arrests, if you did return) but thats about it.

TeMPOraL · on March 4, 2021

I'd expect such websites to communicate it, though, and do the minimum of work to prevent me from entering with ad blocker on. I'm fine with paywalls, and even "disable your ad blocker" walls - they're up-front, and in agreement with spirit of the HTTP protocol.

Much like cinemas. If I try to bring in food I didn't bought at the cinema store, I will be stopped at ticket check and refused entry. Also, the cinema doesn't require me to buy food as a condition for watching the movie. If they did, I'd consider it the price of entry.

What current ad-loaded websites are doing is the equivalent of letting you in to watch a movie, and then having someone come to you as you're watching, and poke you and yell at you because you didn't buy anything at the store.

> They can call the cops and have you arrested for accessing the theatre in contravention of the rule.

I've never heard of a cinema calling cops on people who snuck in food into the theatre. Is that US-specific?

sandworm101 · on March 4, 2021

It only happens when people are discovered, begin arguing, are told to leave, and don't. It's mostly drunk people who have brought alcohol into a movie. Cops remove them as trespassers who refused to leave after being told to.

rattray · on March 3, 2021

I might imagine that Google Inc could sell a product to Mozilla Inc which they could embed in their Firefox browser, enabling them to sell FLoC data to Google. For example.

ogre_codes · on March 3, 2021

> If the browser is supposed to send the information, what are they going to do with those of US who do not use Chrome?

What would stop them from implementing FLoC in the cloud for non-Chrome users? Fundamentally, where the data is stored is meaningless. So long as they are pushing everything through this same "FLoC" model, they are claiming they won't be tracking individuals.

So on Chrome, FLoC is local—your own browser working against you—for Safari and FireFox, FLoC is cloud based.

So long as the web sites in question are running Google Analytics, they still have the software on your system on a huge number of sites.

jefftk · on March 3, 2021

> for Safari and FireFox, FLoC is cloud based

This would require being to identify a user when they are on many different sites (cross-site tracking), which all the major browsers intend to prevent.

(Disclosure: I work at Google, speaking only for myself)

ogre_codes · on March 3, 2021

First, you have to understand that Google as a company has lost all credibility in much of the community. There have been too many issues like this one where Google gives you the impression they are turning off tracking... but doesn’t actually stop tracking you.

https://www.independent.co.uk/life-style/gadgets-and-tech/ne...

There was also the Do not Track setting, which Google ignored. So in my book (and I’m not alone here), anything coming from Google is taken with a huge grain of salt.

For some time, there has been a sort of cat and mouse game between advertisers and browser makers. Do you know Google isn’t using fingerprinting to identify us? Or just IP or some other means?

jefftk · on March 3, 2021

> Google as a company has lost all credibility in much of the community

What I'm saying is Safari [1], Firefox [2], and Chrome [3] are planning technical mitigations that would prevent anyone from implementing a cloud-based FLoC. Even if you don't believe Chrome, it wouldn't be possible in Safari or Firefox.

> Do you know Google isn’t using fingerprinting to identify us? Or just IP or some other means?

https://blog.google/products/ads-commerce/a-more-privacy-fir... has "once third-party cookies are phased out, we will not build alternate identifiers to track individuals as they browse across the web, nor will we use them in our products" and "our web products will be powered by privacy-preserving APIs which prevent individual tracking while still delivering results for advertisers and publishers" which seem pretty clear to me? Additionally, I work in client-side ads infra, and I'm pretty sure I would know if Google were fingerprinting users to target ads.

[1] https://webkit.org/tracking-prevention-policy/

[2] https://wiki.mozilla.org/Security/Anti_tracking_policy#1._Cr...

[3] https://www.chromium.org/Home/chromium-privacy/privacy-sandb...

HDMI_Cable · on March 4, 2021

From what I've read on FLoC (not much so I may be wrong), wouldn't having anonymized ad-targeting be a good outcome? Google and websites still get their ad revenue, and we still get ad-supported products. I guess there's always the possibility of a badly-anonymized algorithm leaking data, but other than that it still seems like a step up from that invasion of privacy being a given, no? If I'm misinterpreting the jist of FLoC, let me know.

toxik · on March 3, 2021

I think an important question with the FLOC stuff is sort of brushed aside.

So, you have millions of users, and you make it "k-anonymous", or "anonymous if you squint real good". Is it really possible to find k such that privacy is meaningfully preserved, given the heaps of other information you have?

Say I belong to some cohort, and you know my IP address. I think this would be enough to fingerprint me reliably, never mind OS version, browser, plugins, etc.

jefftk · on March 3, 2021

> and you know my IP address

https://www.chromium.org/Home/chromium-privacy/privacy-sandb... links to https://github.com/bslassey/ip-blindness for how they intend to handle this.

(Disclosure: I work on ads at Google, speaking only for myself)

sandworm101 · on March 3, 2021

Even if they stay with cohorts, I assume people will be members of a variety of them.

New parent. Lives in Springfield. Works at a power plant. Drives a car. Owns house. Horrible credit rating. Married with 2+ kids. Voted in last election. Ex military. Ex astronaut. Once purchased an effective baldness cure. How many seemingly large cohorts does it take before you are really talking about one clearly identifiable person?

laurent92 · on March 3, 2021

> How many large cohorts does it take before identifying a person

It takes 33 “perfect” yes/no questions to identify uniquely anyone on earth. It is seemingly large since each question splits the world in two.

Of course this is in a perfect world, but I remember seeing quizzes which would guess any object given a few yes/no question. Or perhaps it was people.

retzkek · on March 3, 2021

> Of course this is in a perfect world, but I remember seeing quizzes which would guess any object given a few yes/no question. Or perhaps it was people.

https://en.akinator.com/

bostik · on March 4, 2021

Couldn't get to Dianne Morgan in 40 guesses. The closest it got was to ask in sequence whether the character was British, blonde, an actor and a comedian.

Not many probably know the person in question with her real name - but almost everyone who's ever watched Charlie Brooker's productions should recognise her as Philomena Cunk.

laurent92 · on March 3, 2021

This game is incredible. The question always seem to be unrelated to the person to guess, and suddenly it guessed perfectly.

otabdeveloper4 · on March 4, 2021

Very easy to break if you guess real-world people not famous for acting, sports or politics.

williamscales · on March 3, 2021

On the other hand, I tried to get it to guess Mia Dolan and it took 36 questions, two of which involved Five Nights at Freddies

oblio · on March 3, 2021

It doesn't load the CSS for me, Android/Firefox :-(

ARandomerDude · on March 3, 2021

Does your person have glasses? Is your person male? Does he have facial hair?

At that point the kids usually go for broke and ask me if my player is Susan.

sandworm101 · on March 3, 2021

https://en.wikipedia.org/wiki/Guess_Who%3F

"Each player starts the game with a board that includes cartoon image of 24 people [...] Players alternate asking various yes or no questions to eliminate candidates."

jefftk · on March 3, 2021

> Even if they stay with cohorts, I assume people will be members of a variety of them.

No, users are only in a single cohort: "cohort = await document.interestCohort()" -- https://github.com/WICG/floc

(Disclosure: I work on ads at Google, speaking only for myself)

rattray · on March 3, 2021

Err, what? So there would be one cohort of "new-parent-lives-in-springfield-works-in-power-plant-etc-etc" (combining multiple unrelated interests), it just has to have thousands of people in it?

Do cohorts allow advertisers to say, "Show this ad to anyone in a cohort with an affinity for 'new-parent' over threshold X?" is that so?

Do they let advertisers say, "here are the cohorts of our best 1000 customers, please show ads to anyone else in those cohorts?"

Would people in "my cohort" be the 1000-10000 people most like me in browsing habits in the world?

(Disclaimer, have tried to buy limited quantities of ads in the past, interested in some part as an advertiser)

jefftk · on March 4, 2021

> Would people in "my cohort" be the 1000-10000 people most like me in browsing habits in the world?

Yes: "a browser can group together people with similar browsing habits, so that ad tech companies can observe the habits of large groups instead of the activity of individuals. Ad targeting could then be partly based on what group the person falls into."

> there would be one cohort of "new-parent-lives-in-springfield-works-in-power-plant-etc-etc" (combining multiple unrelated interests), it just has to have thousands of people in it?

I mean, there aren't thousands of new parents who live in Springfield and work in a power plant? You would be in a cohort like "43A7".

> Do cohorts allow advertisers to say, "Show this ad to anyone in a cohort with an affinity for 'new-parent' over threshold X?" is that so?

Not really? An advertiser or (ad tech company) might learn, over time, that people in cohort "43A7" are very likely to be interested in baby clothes while people in cohort "5B7E" are more likely than average but not by that much. So they might be willing to bid more to show baby clothes to someone in 43A7 than in 5B7E, and not bid at all for most other cohorts. But the browser API just gives you an opaque cohort identifier.

> Do they let advertisers say, "here are the cohorts of our best 1000 customers, please show ads to anyone else in those cohorts?"

From my reading of the spec, that seems like it would work well.

rattray · on March 4, 2021

Fascinating... Thank you for digging in on this!

Could a browser vendor send down a mapping of cohort hashes to interest vectors? I don't see anything in https://github.com/WICG/floc that suggests that couldn't happen, and it seems like it could greatly enhance utility without too much sacrifice to privacy. For example, it'd allow an advertiser to target all cohorts who like american football, I'd think.

jefftk · on March 4, 2021

I think this can be built on top of the proposed API? For example, a site about American football could publish/sell the cohort frequencies they observe.

rattray · on March 4, 2021

Ah, that's interesting. Different from what I was talking about but certainly makes sense.

vineyardmike · on March 3, 2021

While I completely agree with your point, and the number required is probably pretty low, the point of FLOC is that no one will need to identify a real person for any “proper” reason.

Advertising rarely is about targeting “John doe” but usually about targeting “[men] && [in Springfield] && [searching for a baldness cures]”.

(I still will do everything in my technical power to disable floc, and still will never trust google). But I do think the implementation removes most of the advertising incentive to track on an individual person level.

sandworm101 · on March 3, 2021

Legitimate advertising. What about the other stuff? What about advertisers who want to swing very specific voters? What about nefarious people who want to put a malware link in front of a specific person. [active military]&&[over 55]&&[lives beside base X]&&[awake before 6am]&&[college education]&&[searched for "retirement planning"] will probably get you the most senior officer at a base/unit. Same too with senior politicians.

vineyardmike · on March 4, 2021

> Legitimate advertising. > What about the other stuff?

Thats what makes FLOC tolerable. The legit stuff is enabled, and it makes it harder to get the other stuff (not impossible of course).

> What about nefarious people who want to put a malware link in front of a specific person.

It'll at least be harder than pre-floc (ideally).

The point is to remove the incentives to collect as much data about an individual (you don't need full profiles of someone) by creating even easier ways to get useful targeting without the direct individual identification.

---

(also, i'm broadly against tracking in general, just to be clear. The point you raised is very valid - esp with existing methods).

kens · on March 3, 2021

Am I the only person who prefers well-targeted ads? If I see a targeted ad for a power supply IC on a random website, sometimes that's interesting and useful. On the other hand, Twitter is terrible at targeting and the ads are just annoying. Half are ads for random "as-seen-on-TV" quality products and the other half inexplicably think I am an oncologist.

logshipper · on March 4, 2021

I suppose there is nothing against well-targeted ads as long as the user has consented to having their data being used for targeted-advertising purposes.

The current landscape, however, does not prioritize user-privacy and serves ads to unsuspecting users who do not always know the terms they agreed to.

Moving away from the present opt-out to an opt-in model would give the user a say in how they would like to be tracked across the web. This is also what Apple is trying to do with its "nutritional-info" (not fanboying Apple here) labels. A user is free to consume the content they would like, but they have a right to know how their data is being used.

phkahler · on March 3, 2021

Would that method benefit from Googles existing people database? I'm thinking it would, and maybe disabling current tracking will kick the ladder out from potential competitors!

Still probably a step in the right direction.

josefx · on March 3, 2021

> that they are not giving people unique identifiers

How far can this data be reversed? If I have a group of ten people can google tell with high reliability that probably three of them are homosexual, two have Alzheimer's and none are pregnant?

comboy · on March 3, 2021

Why is it compared to "random user grouping"? I'm probably misunderstanding something, because it doesn't seem hard to be few hundred percent more accurate than randomness.

danielrhodes · on March 3, 2021

There are probably a few things going on:

* Google sees the writing on the wall: people are becoming more privacy conscious, but more importantly governments in many countries are making motions to suggest this sort of tracking could be regulated in the future. They need to stay ahead of the curve.

* The value of tracking people in this way may be less than everybody seems to think - not all data about a person's activities is equally monetizable. Just because you read an article about Julius Caesar and then looked at some clickbait article about Britney Spears doesn't mean an advertiser can sell you a chariot.

* Google in making the first move can create an advantage for itself in the market

* Much of Google's revenue comes from their own properties -- in the case of search, they are already well positioned to show you relevant ads because you have shown intent. No need for cookies there.

* Google has a dominant position with Chrome and the article suggests they want to do fingerprinting of a person's browsing behavior and put people into different cohorts without sending the raw data to Google. This is similar to how Apple does machine learning on device. If Google has exclusive access to these cohorts, they have a strong incentive to move to a cookie-less model because it puts other ad networks at a disadvantage.

SilverRed · on March 3, 2021

It seems like you could get most of the benefits by just doing content aware adverts? Is the advert showing on a page about guitars? Show an advert about guitars and music accessories. We don't need that advert to follow us around for the next month.

It also completely eliminates the creep factor and conspiracies around adverts. "How did they know I was interested in this, are they listening to my phone?" goes away when its the same as the content you are looking at currently.

fastball · on March 4, 2021

This is what DDG does and they're not exactly rolling in dough.

Also there are a plethora of contexts where this wouldn't work well. For example, Instagram has the best targeted ads I've ever seen – basically every ad I get is for something I would consider buying.

I don't think their ads would work nearly so well if I got an add for a guitar just because my friend tagged her post with #guitar.

throw_awy_1 · on March 4, 2021

That may not maximize ad revenue however. For instance if the Going rate for placing guitar ads is low (not many people bidding to show guitar related ads) but mortgages is quite high google (or someone) would rather show the expensive mortgage ad than the cheap guitar ad.

danielrhodes · on March 3, 2021

Yeah it's unclear to me that having a list of all the pages you've visited (or similar) provides a strong enough signal that advertisers will bid higher or result in greater conversion. In some cases it surely will, but then it becomes a question of what proportion of Google's revenue is dependent on this scenario. If it's not a significant portion, Google might just say the potential regulatory and PR headache is not worth it.

jay_kyburz · on March 4, 2021

I don't agree, I've been searching for a new backpack on my iPad which is too old for an ad blocker. Now every single website, regardless of context, is packed full of adds for backpacks.

A constant bombarding reminder that I need a new backpack. 90 percent of websites I visit are not about backpacks. I'm getting 90% more ads about backpacks.

SilverRed · on March 4, 2021

Thats a possible factor, at the same time, how many times have you looked at something random that you don't intend to buy and then you see adverts for days for this thing you don't want. How much is wasted on this? I'd say that at the moment you are looking at backpack review posts, that's the highest chance of you clicking on and buying a backpack.

0_____0 · on March 3, 2021

not to rain on the parade here but reading the date on a dime puts you well into sub-milliarcsecond territory, well past what is possible to resolve with even a diffraction limited optical light telescope of any kind from space.

Bigger telescope systems can resolve smaller objects, but even if you were to yeet an enormous, optically perfect, 10m telescope into the very lowest earth orbit possible (not maintainable due to atmospheric drag) you would still be a couple orders of magnitude away from being able to read the date on a dime.

bb123 · on March 3, 2021

What does"sub-milliarcsecond territory" mean? I know nothing of satellite tech.

0_____0 · on March 3, 2021

it's an angular measurement. applied here it means the angular resolution of an optical system.

a degree of arc is divided into minutes of arc (MoA)(1/60 degree each), and there are 60 arcseconds in a minute of arc.

Two points that are 1 milliarcsecond apart, projected on a flat surface 200km away, are about 1mm apart on that surface.

edit: (can plug "tan(0.001 arcseconds) * 200km in mm" into google to check this out for yourself)

burnte · on March 3, 2021

I believe his intent, especially with the word "purportedly", was so infer that the proposed system had amazing abilities, less than it was intended to be a perfectly accurate description.

0_____0 · on March 3, 2021

I agree that it wasn't intended to be literal fact, just figured I'd take the opportunity to flex my mediocre optical systems knowledge. :p

jerzmacow · on March 3, 2021

Could you describe what size of optical telescope you'd need to put in space to read the date on a dime? I'm very curious

0_____0 · on March 3, 2021

Check out this chart.

https://en.wikipedia.org/wiki/Diffraction-limited_system#/me...

I would guess that you'd need 0.25mm resolution, which at 200km would require 250microarcseconds angular res, which doesn't even register on that graph at any wavelength or size of telescope lol

edit: this is a very 'spherical cow' analysis. i'm pretty sure that atmospheric effects will kill your ability to image stuff like this well before angular resolution will.

aasasd · on March 4, 2021

That chart is beautiful. While not really Tufte's dream of ‘a ton of measurements in little ink’, that chart shows me everything I'd want to know about this one relation.

Also, perhaps there's a need for a better naming system than one that produces ‘Extremely Large Telescope’.

0_____0 · on March 4, 2021

It's a bit of a joke as well, hence:

https://en.m.wikipedia.org/wiki/Overwhelmingly_Large_Telesco...

titzer · on March 3, 2021

Impressive! I was going to comment going on gut feeling from layman's understanding of resolving power of optics, but glad to see someone who knows more nail it.

KirillPanov · on March 4, 2021

> even if you were to yeet an enormous, optically perfect, 10m telescope

10m is a tiny aperture: https://en.m.wikipedia.org/wiki/Synthetic-aperture_radar

SquareWheel · on March 3, 2021

Having a system that's almost as good but less offensive to some people (the Privacy Sandbox) seems like a reasonable enough justification to change tactics.

Keep in mind that advertisers really don't care what your name is or where you live. They care that you are into action movies and buy fancy shampoos. The other stuff just creates trouble.

insert_coin · on March 3, 2021

> Keep in mind that advertisers really don't care what your name is or where you live. They care that you are into action movies and buy fancy shampoos. The other stuff just creates trouble.

Google cares on their own.

And current advertisers may not care but who knows what kind of technology they may come up with in the future where that data might be valuable, having extra data has never been a problem for google.

notatoad · on March 3, 2021

it's important to remember that google's goal isn't to track you, it's to make money. So finding something that works better isn't necessarily a better way to track you, just a better way to make money.

tracking, including the costs of collecting and storing all that data as well as the PR cost of people being creeped out by all that data you store, is expensive. Given how accurately i'm targeted by most advertising (not very) it seems perfectly reasonable to assume that ads will be just as effective and just as revenue-generating if their tracking becomes a bit less granular.

visarga · on March 4, 2021

> Given how accurately i'm targeted by most advertising (not very)

I think the problem is not ad targeting, but ad inventory. They want to sell things we don't need, that's the real cause of the low efficiency.

They should better focus on things people actually need at the time of search.

notatoad · on March 4, 2021

it's the same end result though - whether they don't have the ad inventory to make use of hyper-detailed tracking, or they're just not very good at turning the tracking into targeting, then either way performing the hyper-detailed tracking is a waste of money and they can make more money by spending less on tracking me.

shadowgovt · on March 3, 2021

> Maybe that just means that they consider their duopoly enough of a barrier that advertisers will be forced to accept less-targeted marketing, but I doubt it...

My guess is that they smell a change in the regulatory wind (EU and possibly US as well) and anticipate that if they get out ahead of it, those regulations will serve as a competitive moat against other advertisers.

That, or their A/B testing has confirmed ad quality isn't remarkably improved over the combination of search-tuned ads, adsense topic ads tuned to the site the user is browsing, and display ads tuned by data the site owner vends to Google about the user to fine-tune ad selection for them. They have enough data to know one way or another, and if it turns out there's not enough value-add to justify retaining all that user browsing state, dropping collation and storage of it can free up room for dozens of new projects.

It's easy to forget that one of the major constraints on what Google can do, at their scale, is storage and processing; they're so big that from most people's vantage points, those resources seem limitless, but inside Google, they know exactly how much they aren't doing because the machines and networking they already own are already pushing petabytes of data around and are already near max peak capacity for their mission-critical applications. Sometimes, the cheapest way to get more space for a new bet or an existing mission-critical task is to just stop doing something else.

usrusr · on March 3, 2021

Or it could mean that they have data showing that all that tracker targeting isn't actually providing the value to advertisers that they claim it provides and want to move on before that becomes common knowledge.

I'd love to see my scenario be true, but I'm afraid that it's much less likely than yours.

sn_master · on March 3, 2021

That's true for the B1 bomber. During the presidential campaign, when Reagan complained about the jobs lost due to its cancellation, Carter arranged for security clearance for him to know about the F-117 and stealth technology which rendered the B1 obsolete. Later on, Reagan restarted the B1 production anyway.

cturner · on March 3, 2021

There is some overlap, but the planes serve different purposes. The B1 is still in service.

marshmallow_12 · on March 3, 2021

heavy bombers reached their ceiling, i think with the b-52. developing new bombers, as far as i can see, is a criminal waste of resources, post Cold War.

dotancohen · on March 3, 2021

The B-52 can be detected with COTS equipment by anyone who can solder and watch a Youtube video. The B-2 could not be detected by the most advanced radar equipment available to the time of its launch, and even today most equipment not fielded by or funded by Russia or the US could not detect it. Your mailbox key literally has a larger RCS than the B-2.

Stratoscope · on March 3, 2021

Forgive my ignorance, but I'm probably not the only one who is wondering what COTS and RCS mean here?

bee_rider · on March 3, 2021

COTS = Commercial off the shelf (stuff you could pick up in a Radioshack before they became mini-Bestbuys, basically).

RCS = Radar Cross-section (a bit more field-specific, more or less how "big" the thing looks on radar, but it is a bit jargony -- it depends on things like the material and radar wavelength).

KMag · on March 3, 2021

COTS -> Commercial off-the-shelf

RCS -> Radar cross section

cjhockey · on March 3, 2021

COTS: Commercial off the shelf, stuff you can buy easily.

RCS: Radar Cross Section - how big of a reflection an object makes at radar wavelengths.

whiw · on March 3, 2021

"Customer Off The Shelf" and "Radar Cross Section"

joshyeager · on March 3, 2021

Commercial Off The Shelf

Radar Cross Section

marshmallow_12 · on March 3, 2021

russia doesn't need bombing anymore. The stealth bomber has had the rug pulled out from beneath its feet. It was effectively obsolete from day one.

babypuncher · on March 3, 2021

Google owns the operating system running on half of all Americans' phones (and an even larger share overseas). On top of that, Chrome OS is continuing to gain market share.

I think this is your answer right there.

LdSGSgvupDV · on March 4, 2021

I think your last paragraph is the answer. Targeting ad with personal information is questionably profitable to me. I am almost not caught by the ads in websites but by some search results (google search first page) or article in forums like here or reddit or some people I followed. Maybe Google just find out they actually do not need such garbage personal information. I said garbage because lots of them are really caused by random events (sometimes you are interested in something. But after that period of time, you would not ever get in touch with it again) or fake information. And, the ecosystem to store these information is expensive.

somedude895 · on March 3, 2021

> There's no possible way that Google is going to make a change to their biggest money maker

To be fair, Google's biggest money maker is their SEA, which doesn't rely on your behavioral data, since user intent is given through a search query.

moksly · on March 4, 2021

In my completely anecdotal experience, Facebook has been far, far, better at targeted advertising than google.

Facebook seems to know what Kickstarter products I’ve backed, what specific miniature paints I like and seemingly which I’ve already bought. Possibly because a lot of my miniature hobby socialising takes place in messenger and on Instagram.

Google by contrast is so bad at advertising, that I often have to re-search on duck duck go to find a webshop that I can actually use post-brexit. Maybe because google knows I don’t speak Spanish, Italian and suck at German? But I mean, I still buy stuff from Spanish websites using the translate function in chrome, so, shouldn’t they know better?

Anyway, for me personal and anecdotal, it seems google is really bad at predicting what I want. I mean, even for content. The front page of YouTube is full of videos I’ve already seen...

laurent92 · on March 3, 2021

Yep, third-party cookies must be very inefficient by now, as well as the js GA script that enables tracking. This is probably the first two things ad-blockers block.

Of course they move away from this, the must have brainstormed and tested new ways for years. OTOH, it is stunning that we’ve invented tech that is stable even when an ad-blocker interferes with the webpage, cookies, js, requests, etc. Try that with binaries! (which is probably where they plan to move the code)

jedimastert · on March 3, 2021

> their biggest money maker

Correction: if you go by the profit reports from Alphabet, Google Display Ads is not their biggest money maker by a long shot. That's still Search Ads

ppod · on March 3, 2021

Point of information: that's a point of information.

jedimastert · on March 3, 2021

You right

oiirewr234324 · on March 4, 2021

> I strongly suspect that they've already figured out something which will work "better" behind the scenes.

Oh stop it with these conspiracies.

Google (like Facebook) has been pushed in this direction for ~2-3 years by changes in Safari to the point that Google has started preparing for this pro-actively

Why do you think Google hasn't complained like FB about the upcoming changes in ios ?

cratermoon · on March 3, 2021

Do you know our Lord and Savior Unified ID? https://www.thetradedesk.com/us/knowledge-center/what-the-te...

oxfordmale · on March 3, 2021

It is known that personally yargeted advertisements are relatively ineffective compared to advertising targeted at a group. For example, advertising for luggage on a travel website tends to lead to higher click through rates rather than showing a personalised advertisement. Personalized ads often try to market products you just have bought.

otabdeveloper4 · on March 4, 2021

(I've worked in adtech for over 15 years.)

Advertising is just applied sociology, and sociology is just applied statistics. So statistically significant and unbiased sampling is what makes advertising work.

Highly-targeted and personalized ads are a crutch due to the fact that unbiased sampling is pretty much impossible on the internet. (There's a joke about this: "an online poll has shown that internet penetration has now reached 100 percent".)

Neither advertisers nor publishers actually want to collect high-resolution data, it's a huge cost with very little benefit.

amelius · on March 3, 2021

> a satellite which could purportedly read the issue date on a dime lying on the sidewalk.

In the future, you don't have to rely on serendipity to find money on the street. You just use a satellite to find it for you.

ChristianGeek · on March 3, 2021

Some of us already use them to find Tupperware in the woods!

vmception · on March 3, 2021

this legend is decades old actually

the US hegemony relies on a perception of omnipotence, so there will always be a lot of boisterous claims

ARandomerDude · on March 3, 2021

Two-service veteran here. In my observation, most people also "learn" about the military from Hollywood, which is almost always comically bad at portraying the kinds of people, cultures, technology, tactics, etc. actually in the military/government. What would 24 have been without Chloe's ability to hack the video camera on a 1980s alarm clock by logging into a GPS satellite?

vmception · on March 3, 2021

Yeah, I feel like there were technological surprises by the federal government up to the 1980s but not really since then, but perception has been riding off of that

Even the NSA presentations and code leaks this century didnt really have surprises more just angst about their lack of accountability

CarelessExpert · on March 3, 2021

> There's no possible way that Google is going to make a change to their biggest money maker that would make them LESS money.

I know this sounds laughable given how toothless regulators have been thus far, but let's not forget that a) fines under GDPR have been growing and are likely to continue to do so, with ad tech in a particularly awkward spot, and b) both the EU and US governments and regulators have started nosing around big tech on a variety of fronts, including privacy and antitrust concerns.

Given that, the cost-benefit analysis becomes pretty complicated. They may feel a shift to a scheme which lowers revenues while avoiding regulatory scrutiny now makes sense.

chiefalchemist · on March 3, 2021

Spot on. They're not going to abandon their core, their bread & butter, their monopoly without a new & improved Plan A.

Full disclosure: I'm reading The Age of Surveillance Capitalism and I get more skeptical (and fearful) with each page turn.

https://www.wnycstudios.org/podcasts/otm/segments/living-und...

samschooler · on March 3, 2021

I feel like the phrase “tracking individual people” is how Google is going to keep tracking you as well as they did before.

With the announcement of federated learning of cohorts [0], it allows Google to use their massive amount of information they have on you already, as well as their dominance in AI to cut out the easy way of tracking people, and make it harder for anyone else to personalize ads.

Don’t get me wrong it’s a good thing that we are removing third-party cookies, but the idea that Google is confident they can track you without them makes me worried for the future of advertising. This move is undoubtedly going to cement google as the best ad tracking company for some time into the future.

[0]: https://www.cnbc.com/2021/01/24/google-extremely-confident-a...

VSerge · on March 3, 2021

With regards to the existing footprint of Google, it's useful to keep in mind that Google and Facebook combined are already catching 74% of advertising dollars spent online, as per the bloomberg article shared here today: https://news.ycombinator.com/item?id=26325950

It's also worth noting that other groups have proposed systems that would not result in Google creating another monopolistic product line. Google did not seem very interested in such alternatives.

nonpolitic · on March 4, 2021

Exactly this. It's Google saying "we don't need third party cookies to track people", and removing them just makes life harder for their competitors.

jsonne · on March 4, 2021

Google isn't the only one doing this though. The Trade Desk and the wider programmatic industry seem to be somewhat pushing to unify around Unified ID 2.0 especially as they released it to the public. (Eg one company isn't the gatekeeper on the cookieless future for ad targeting.

ErikVandeWater · on March 3, 2021

Yeah what is the cohort size and how many cohorts can a person be part of? This sounds like an underhanded way Google can say they don't track individuals while in practice it is almost the same.

jefftk · on March 3, 2021

> what is the cohort size?

"The browser ensures that cohorts are well distributed, so that each represents thousands of people."

> how many cohorts can a person be part of?

It's just:

    cohort = await document.interestCohort();

See https://github.com/WICG/floc

(Disclosure: I work on ads at Google, speaking only for myself.)

josefx · on March 4, 2021

Only thousands? The x-client-data field all google owned servers get from chrome has over 65000 possible values. Wouldn't that make a combination of both unique again?

jefftk · on March 4, 2021

The X-Client-Data header is for evaluating the effect of Chrome changes, not ad targeting:

"Additionally, a subset of low entropy variations are included in network requests sent to Google. The combined state of these variations is non-identifying, since it is based on a 13-bit low entropy value (see above). These are transmitted using the "X-Client-Data" HTTP header, which contains a list of active variations. On Android, this header may include a limited set of external server-side experiments, which may affect the Chrome installation. This header is used to evaluate the effect on Google servers - for example, a networking change may affect YouTube video load speed or an Omnibox ranking update may result in more helpful Google Search results." -- https://www.google.com/chrome/privacy/whitepaper.html

josefx · on March 4, 2021

> The X-Client-Data header is for evaluating the effect of Chrome changes, not ad targeting:

And my personal browsing habits are for me, that doesn't seem to stop Google from inventing new and imaginative ways to use them for ad targeting. The id is also long enough to be unique in a group of thousands, which is a size range in which the groups created by this new way of targeting ads apparently are.

krick · on March 3, 2021

Absolutely, and I'm not even sure it's a good thing that we are removing third-party cookies. I mean, if we take a step back: why should I even hate them? Yes, there are problems with them, there have been exploits using them and all that, but I'm pretty confident there will be similar problems with pretty much anything, because there's always a clever way to abuse some technology that was created with the best intentions in mind. And cookies, at least in theory, mean that you are putting me in charge of whatever tracking token you assigned to me: I can open my browser settings and clear it at any time. It's a good thing.

Of course, there are reasons why they aren't such a good thing as I make them sound to be. And the fact that you don't feel in charge of them should rather emphasize, than negate my point: if such a simple thing turns out to be something you learn to hate, don't you ever dream of something more complicated and less transparent to be your salvation.

And ever since federated learning became "the next big thing" I was arguing here on HN (rather unsuccessfully, I feel) that this doesn't mean that there's suddenly "no tracking", buying into that is just naïve. I'm not directly commenting on FLoC API, both because I don't know it well and because it's usually about making a stronger point anyway: that performing learning client-side and then submitting some sort of "depersonalized data" means better privacy and essentially is having your cake and eating it too. But I think it's just misleading to describe things this way. The idea that "sensitive data" is your document id, your name/surname and such is hopelessly outdated. In fact, it's funny that anyone even believes that, because if you are not a complete bureaucrat, it should be clear that you are not your name or some other sort of ID number, you are your behaviour. It's just that people are used to the idea, that Big Brother needs that simple ID number to keep you in line. But perhaps he doesn't anymore.

What I'm saying is essentially that sending "depersonalized data" is just obfuscating which personal & sensitive data is actually exchanged. And (again, not in the context of FLoC API, but more generally) it can be any data: I think we all remember funny stories how completely impartial GPT-3 learns someone's phone number and such.

So, this, and all of what you already said about the monopolization of people-tracking market.

rektide · on March 3, 2021

> With the announcement of federated learning of cohorts [0], it allows Google to use their massive amount of information they have on you already, as well as their dominance in AI to cut out the easy way of tracking people, and make it harder for anyone else to personalize ads.

yes, yes yes yes. This proposal is that:

* the web have no individual tracking

* the browser, instead uploads our every activity across all sites into the cloud

* the cloud/browser-operators (google, firefox, microsoft, opera, brave) take these troves of big data & apply machine learning to generate cohorts

* the browser then places us into these cohorts

this would, as parent post suggested, absolutely "cement google as the best ad tracking company". it also makes it radically harder to create a new browser. Brave would have to become not just browsing software you run on your computer, but a cloud-service, gathering reams of data like Google, generating cohorts out new big data systems. how is Lynx/elinks supposed to become compliant with this new proposal? how is anyone other than the already existing giants of the world supposed to do this?

bold bold bold plan organize & own all the worlds data here. privacy on the web, but none in the web browser: brave new world Google. you madmen.

qyph · on March 3, 2021

In floc no personal data or web history is uploaded to the cloud. This is the essence of "federated learning" - your browser does the big ml computation, then just uploads the result.

rektide · on March 3, 2021

Where does the data to compute on come from?

Personal information will attempted to be scrubbed, but it very much is an upload of one's detailed web history to the cloud. Google alleges they wont know it's my tracks, but these federated learning systems are all powered by endless reams of data gathered from us.

That the decision of which cohort we are in can be done locally does not change the fact that that data has to come from somewhere, is gathered, en-mass, via huge bulk collections, of very specific, detailed data. It is only weasel words that it is called "not personal data"; our individuality is being harvested, tracked, modelled via these systems. That the information is free of personal identifiers does not make me feel much better about this all.

And it is something few other enterprises will ever have the capability to repeat or compete with. It mandates the web browser be powered by clouds & big data.

throwaway316943 · on March 3, 2021

> It’s difficult to conceive of the internet we know today — with information on every topic, in every language, at the fingertips of billions of people — without advertising as its economic foundation

What’s difficult to conceive is how the currently bloated and monetization heavy internet could survive without advertising. An internet composed of relevant and useful information stored as light weight documents could easily (and used to) exist without advertising. Without ads we probably wouldn’t have millions of identical recipe sites with infuriating backstories, pop ups, share buttons, in-line promotions, etc. but we would still have people sharing their grandmother’s apple pie recipe on personal sites along with a few paid services that add real value instead of SEO garbage and social fluff.

folkrav · on March 3, 2021

> An internet composed of relevant and useful information stored as light weight documents could easily (and used to) exist without advertising.

Any concrete traffic measurements between those sites then and now? I'd be extremely surprised if traffic didn't shoot up by orders of magnitude.

That's also assuming that no ads means we'd go back to the mid-90s internet. Pretty sure there are still companies that would like to do business online, even without targeted ads. The tactics would change, that's it.

> Without ads we probably wouldn’t have millions of identical recipe sites with infuriating backstories, pop ups, share buttons, in-line promotions, etc.

True for pop-ups, in-line promotions or ads. But SEO will be a thing as long as we have search engines. Hell, earlier search engines were getting gamed with keyword stacking all the damn time. Bringing you on the page and showing you ads wouldn't be enough to make money, but I don't see how you'd even get rid of sites wanting traffic. Unless you're really advocating for getting rid of commercially motivated sites - which I'll assume you're not, as it is kind of delusional, IMHO.

> we would still have people sharing their grandmother’s apple pie recipe on personal sites along with a few paid services that add real value instead of SEO garbage and social fluff

That's a whole lot of extrapolation, and a rather subjective judgement call. Hell, one could definitely call this very site "social fluff".

sawmurai · on March 3, 2021

Yes, nothing more frustrating than quickly looking up a recipe detail whilst in the middle of cooking it and having to scroll deep, deeep, deeeeep down the useless SEO text only to scroll past that one tiny detail. My wife and I are creating our own cooking book as a PWA. Only bullet points, no fluff. Short: useful.

oblio · on March 3, 2021

> What’s difficult to conceive is how the currently bloated and monetization heavy internet could survive without advertising.

The no-monetization web ship sailed circa 1995. The internet is about a lot of money and it will be about even more money. If it's not monetized through ads, it will be monetized in other ways (probably regular sales but most likely subscriptions).

throwaway316943 · on March 3, 2021

I should have specified ad monetization or product placement, brand promotion and things in that vein. I’m not against commercializing of the internet but I am against the ad revenue model that heavily distorts the content and presentation. I don’t know what the winning model looks like, lots have been suggested like micropayments, subscriptions, donations, and others more far fetched but none of that can take root until we remove the toxic ad industry.

folkrav · on March 3, 2021

I'm still kind of surprised to see people think of the internet as this static information library it was originally thought as. It hasn't been this way for decades.

cblconfederate · on March 3, 2021

It is also a flat out lie because the "information on every topic, in every language, at the fingertips of billions of people" is not funded by ads, ads fund google, which keeps a 70% cut for themselves. Information is basically impossible to monetize these days unless people charge. Google does not create information, it copies/steals it.

It is difficult to conceive Google without advertising as its economic foudnation.

tehlike · on March 3, 2021

How did you come up with "google keeps 70% to themselves" fact?

Google revenue cut is around 30%, not the other way around.

cblconfederate · on March 3, 2021

https://mediatel.co.uk/news/2016/10/04/where-did-the-money-g...

tehlike · on March 4, 2021

I think this is an anectode.

op03 · on March 3, 2021

Whats odd about the recipe story is my folks have accumulated quite a collection of cookbooks over the years, yet for mysterious reasons both prefer to endlessly scroll through the ad infested nightmare on every cooking site recommended by the Google app. Some of these sites have an Ad after every sentence.

eek04_ · on March 3, 2021

As somebody that has quite a few shelf-meters (shelf-yards) of cookbooks: If I'm looking for a specific recipe, it's still faster to go through the online recipes than getting out the cookbooks and going through the index and finding it. My phone is also more practical to lug around than a cookbook.

op03 · on March 3, 2021

Thats true.

dageshi · on March 3, 2021

Nostalgia is a hell of a drug

sandworm101 · on March 3, 2021

>>Google plans to stop selling ads based on individuals’ browsing across multiple websites >> to target ads without collecting information about individuals from multiple websites

Key words:

Individual: They might still track persons by group, by placing people into pools of like-minded internet users and track the browsing trends of this group.

Browsing: Geographic location, sleep cycles, or media streaming/viewing preferences are still on the table.

Multiple: Tracking within one website is still a thing. YouTube habits are still a go.

Websites: Apps, and everything we do with them, are not "websites".

While I think this is a step in the right direction, I don't see this statement as very limiting. It only addresses a very narrow type of tracking.

sbarre · on March 3, 2021

Yeah at which point does their sub-grouping effectively reduce us back down to identifiable individuals in everything but name?

StevePerkins · on March 3, 2021

> "Instead, Google says its ad-buying tools will use new technologies it has been developing with others in what it calls a “privacy sandbox” to target ads without collecting information about individuals from multiple websites. One such technology analyzes users’ browsing habits on their own devices, and allows advertisers to target aggregated groups of users with similar interests, or “cohorts,” rather than individual users."

I'm having a hard time parsing this out, and seeing what's actually changed. How do they determine an individual's "cohort(s)", without collecting information about that individual across multiple websites?

Is it simply that the data is collected and processed client-side, rather than server-side? Would using a non-Chrome browser effectively opt-out altogether, then? I find this difficult to believe.

cjg · on March 3, 2021

The idea is that they group you with N other similar people where N is large enough to anonymise you enough to stop the worst complaints but small enough that advertisers still find it useful.

Some sort of principal component analysis to determine your "cohort".

An advertiser isn't that interested that you are you. They care that you are, for example, 55 with an interest in gardening and have been looking at lawnmower review sites.

derekp7 · on March 3, 2021

So if I look up the specs on a pair of Bose QC-35's, I won't get internet-stalked with ads for that specific pair of headphones for the next 3 weeks whenever I pull up a weather report? But in stead I may see an uptick in general audiophile ads if I'm constantly reading articles about new gear that comes out? At least that sounds like a partial win in my book -- the first case was too creepy because it was so blatant. The second one may be worse because it is more subtle, but it creeps me out less for some reason. Kind of like how 2-3 decades ago when computer magazines were a thing, I kind of expected ads for computer parts in those magazines.

Edit: Actually, come to think about it, I wonder how people would have felt in the 80's if they got one of several editions of a newspaper based on magazines that they also subscribed to. Households that subscribed to fitness magazines would get a newspaper with more gym membership and weightlifting gear ads, and households that received woodworking magazines would have more Craftsman ads in their newspaper. Would people appreciate the customization of the paper, or would they see it as an invasion of privacy?

jefftk · on March 3, 2021

> if I look up the specs on a pair of Bose QC-35's, I won't get internet-stalked with ads for that specific pair of headphones for the next 3 weeks whenever I pull up a weather report

The WSJ seems to be describing https://github.com/WICG/floc, but the kind of remarketing you're describing is what https://github.com/WICG/turtledove is intended to support. Advertisers would still be able to run that kind of personalized ad, but the browser API would not allow them to learn your browsing history in the process.

(Disclosure: I work on ads and Google, speaking only for myself)

dylan604 · on March 3, 2021

To me, ads following you around the internet about a specific thing that you browsed is just an example of how the targeting ad system is broken. A better ad system would see that you were interested in specific item and then advertise other items that mesh nicely with the use of the specific item. If you bought A, then there's better chances you's be willing to spend money on B too.

throwawayboise · on March 3, 2021

Or you can just use an effective ad blocker (I use uBlock Origin) and not see ads at all. IDK why everyone doesn't do this.

izacus · on March 3, 2021

"Owners of QC-35s" can easily be a cohort, especially since there's a lot of people in that group and it's easy hit the anonymization score required.

mfer · on March 3, 2021

You may still be stalked about the Bose QC-35's. It really depends on how they create the "cohorts". The new design doesn't tell us their intent here. People who are interested in Bose QC-35's could be a "cohort" you are put into.

amelius · on March 3, 2021

It's still bad. If you are in the X cohort and buy Y, the seller will know that you are in the X cohort if the seller targeted X, even if Y has nothing to do with X. This is the fundamental breach of privacy that Google can't get around.

cjg · on March 3, 2021

Unfortunately if enough people have been looking at Bose QC-35s then there's no reason why that couldn't be a fact about your cohort.

Graffur · on March 3, 2021

I thought this was how it already worked..

jefftk · on March 3, 2021

Here's Chrome's technical page for Privacy Sandbox: https://www.chromium.org/Home/chromium-privacy/privacy-sandb...

The bit you're quoting sounds like the WSJ trying to describe https://github.com/WICG/floc. That depends on the browser choosing to support the API, yes.

(Disclosure: I work on ads at Google, speaking only for myself)

mfer · on March 3, 2021

This feels like some slight of hand targeting the current political and legal efforts looking into Google.

For example, if they aren't collecting information on you than how do they have enough information to create cohorts? Is it that they are deeming information processed on your local computer running their software something that isn't them collecting? This seems like a form of misdirection.

Without knowing the technical details (which one never knows with Google), it leave me the impression that they have moved some of their categorization software client side (Chrome, web workers, etc) and are saying that they aren't collecting the data in that case.

Is it Google using remote devices as edge devices to do a bunch of work they'd been doing server side?

kuu · on March 3, 2021

This is probably related to the "ban" of third party cookies [1] and most likely they just found a way without using cookies but other technologies - most likely biometric features + Chrome data in case you're using Chrome

[1] https://www.cookiebot.com/en/google-third-party-cookies/

Edit: I did not have time to read it, but here is the paper and probably something like the algorithm they want to use: https://github.com/google/ads-privacy/blob/master/proposals/...

cblconfederate · on March 4, 2021

So now you, and your intellectual soulmates (whom you haven't met yet) will be targettable as a group. Hmm

marlowe221 · on March 3, 2021

A non-Chrome browser is always a good idea.

rossdavidh · on March 3, 2021

Hypothesis: the real motivation is that Google has determined the following:

1) we can sell all our ads for just as much money anyway, even without targeting based on web browsing, because the advertisers will take what we let them get

2) targeting based on browsing history is more complicated, requiring more developers and server capacity, which Google could certainly afford but it has plenty of other things to do that pay off better

3) it turns out that advertising based on what website you are currently on, works just as well anyway; to put it another way, all that "show a person who just bought a car an ad for another car" targeting was not really working.

"We respect privacy" sounds better than "our targeting accomplished almost nothing and you'll buy the ads anyway."

MetaWhirledPeas · on March 3, 2021

> to put it another way, all that "show a person who just bought a car an ad for another car" targeting was not really working.

Boy that sure is true for me and has been for a long time. I'm often bewildered not by how accurate my ads are, but rather by how braindead they seem to be. I'm not convinced algorithm-based ads work at all.

In fact I would hypothesize that ads are often more effective when they are unexpected, for something the viewer has never considered buying before. A targeted system would hide such ads.

rossdavidh · on March 3, 2021

Excellent point which I had not considered before.

izacus · on March 3, 2021

Well this is kinda obvious - and a good thing I guess? If Google can sell ads without tracking everyone, that seems like the best win? It protects our privacy and removes them from the fight for more tracking in pretty much the easiest way possible.

tehlike · on March 3, 2021

Google doesn't work that way, and in general a corporation pretty much converges to a point where they will put resources on wherever there's dollar.

You are underestimating the big picture, which is - google is really left between apple and regulations (gdpr, ccpa etc) pushing privacy front and google have no choice but to go in this area.

rossdavidh · on March 3, 2021

"...will put resources on wherever there's a dollar."

Yes, but it may be that they've decided there's no dollar here. Or, to put it another way, they get that dollar whether they track your browser history or not, so why bother? It's more like cutting corners, which corporations are most certainly willing to do, but in this case we like it (but that is probably only incidentally of interest to top executives).

_cs2017_ · on March 4, 2021

The best ads I got were on Facebook, and based on something to do with my history, although in what way exactly, I cannot be sure. I got an ad about Triplebyte (got numerous job offers from that); ads for concerts that I was interested in; and ads for art exhibitions very specifically related to my interests (which I would have 100% missed if not for the ad).

So I'm super disappointed that I will not be getting such ads from Google. In fact, it annoys the hell out of me that people who are concerned about privacy created such a strong anti-ad-personalization movement. This harms people like me who prioritize getting valuable information over privacy. I think people should be allowed to enable their own privacy settings, but activists and politicians should not force their ideology on the society to such an extent as to scare corporations away from personalization.

Though to be fair, I don't recall a single useful ad from Google in my life. So maybe I overestimate the damage the privacy advocates are causing.

jqpabc123 · on March 3, 2021

So one of the biggest privacy invaders on the planet is charting a course towards "privacy first"?

If you believe this, maybe you would like to buy a bridge?

domnomnom · on March 4, 2021

A lot of people work at google. A subset of googlers are very focused on privacy first.

hobofan · on March 3, 2021

- Yeah, I'll believe it when I see it

- It also doesn't seem too unlikely, given more and more talk of the possibility that "targeted" ads don't actually work that much better than non-targeted ones, and Google are in the best position to verify that. Switching to ad models that are not based on exact user tracking, as long as they have the monopoly position might be the best way to ensure future profitability before they lose their users due to privacy concerns. All the AI and other technical and other monoplistic (AMP, Maps, etc.) advantages might be good enough to beat any tracking-based competitiors.

jqpabc123 · on March 3, 2021

... possibility that "targeted" ads don't actually work that much better than non-targeted ones

They don't and here is why --- if you search but don't purchase through a Google ad, you're likely to still see ads for the item following you all across the internet for months after you made a purchase elsewhere.

This does nothing but annoy the user.

drewcoo · on March 3, 2021

Oh no, no, no. It's "more privacy first." And that may not be "privacy first" at all.

jqpabc123 · on March 3, 2021

"Privacy first" is Google-speak meaning "the first thing we do is eliminate your privacy".

sagivo · on March 3, 2021

It's a good time to pivot. Google is one ad-block away from loosing 70% of their revenue [1]. They should adopt the model that DuckDuckGo uses [2] and offer ads based on search query vs user data tracking.

[1] https://www.statista.com/statistics/266471/distribution-of-g...

[2] https://spreadprivacy.com/duckduckgo-revenue-model/

api · on March 3, 2021

They should also offer a paid search engine with more advanced features and better results. I would pay for that if the cost were reasonable.

arcturus17 · on March 3, 2021

You mean one where SEO-driven garbage content didn't flood the ten first pages of results? Isn't that a lost battle, ie, wouldn't the same guerilla army of SEO marketers who've driven things to where they are right now find ways to pollute that too?

skinkestek · on March 3, 2021

I used to keep a txt file containing something like:

-site:highrankingspamsite1.com -site:highrankingspamsite2.com etc

And then I would paste that in at the end of each query.

Today they've removed that kind of spam so I don't need that.

Today my problem is that they fuzz my queries to include useless results. Not a spam problem rather than a problem of Google not realizing that I only want relevant results.

But of course, removing results that doesn't contain what I search for is easy and not an interesting machine learning problem so why should anyone care to fix that?

(That said: lately things have improved, so maybe I should just shut up and hope nobody at Google notices and puts back the insane fuzzing.)

oblio · on March 3, 2021

> Today they've removed that kind of spam so I don't need that.

Try looking for images and see how many results in the first "pages" are not from Pinterest.

eek04_ · on March 3, 2021

If you want to avoid the term expansion, just put double quotes around the words you want to hard require.

As for including "useless" results - that's never a goal, the goal is always to rank the results in the best possible way. What's the best possible way varies depending on the user, and relatively large term expansion is useful for certain users in certain cases.

skinkestek · on March 3, 2021

> If you want to avoid the term expansion, just put double quotes around the words you want to hard require.

Lucky you if that consequently works for you. It is getting better lately, but from 2009 to 2019 they've more or less consequently ignored both dpuble quotes and their own verbatim option.

BTW: the way you write make it sound like you work in search. Is that correct?

eek04_ · on March 4, 2021

I don't, but I collaborate with people that work in search, and regularly hear detailed presentations about aspects of it.

girvo · on March 3, 2021

Which is fine, except even that seems to be ignored sometimes.

api · on March 3, 2021

A paid search engine could have advanced features like "exclude this site from future results" or "don't show me content like this" that used semantic AI profiling. It could leverage more CPU-intensive and data-intensive AI features because you are paying for it so the economics work.

Free search incentivizes the cheapest (in CPU and other resource terms) search possible and monetizing in ways that add the least actual value.

petepete · on March 3, 2021

I'd pay for the old Google Image Search and the ability to blacklist Pinterest.

chrisshroba · on March 8, 2021

You can pretty easily implement this! There are a few Chrome/Firefox extensions that modify Google Image search to look like it used to, and as far as blacklisting Pinterest, I'd recommend creating a custom search engine in Chrome (I assume this is possible in ff too?) with that parameter as a part.

Settings > Search Engines > Manage Search Engines > Add

Then input:

- Search Engine: Google Images

- Keyword: i

- URL: https://www.google.com/search?tbm=isch&q=%s%20-site:pinteres...

Then typing i<space> in the url bar will use that search engine. I've got about 30 of these for various sites and they're a great time saver! (Chicago Craigslist, Google Maps Directions to <place>, DuckDuckGo I'm Feeling Lucky, Wikipedia search, etc.)

kennyadam · on March 3, 2021

I pay for Google One and would love it if they included no ads in with that. I bet a lot more people would take it up too.

s3tz · on March 3, 2021

A drag and drop filter builder alone and I'd fork over a subscription.

supermatt · on March 3, 2021

Id say the title is incorrect, as its still selling ads based on your browsing - its just they arent directly capturing the addresses - instead using some ML for cohort analysis via their "Privacy Sandbox".

It seems to me that this is actually giving them more reach, as now it wont just be websites using google analytics, etc - but EVERY website you visit, unless the site owner specifically opts out - a google-made blackbox that sees EVERY site you visit and labels you accordingly

Few questions for the knowledgeable:

  - Is it be possible to disable?
  - Will they be releasing the datasets for how these ML models are constructed?
  - Do sites in "incogito mode" contribute to analysis?

jefftk · on March 3, 2021

See https://github.com/WICG/floc

> Is it be possible to disable?

"A site should be able to declare that it does not want to be included in the user's list of sites for cohort calculation. This can be accomplished via a new interest-cohort permissions policy." For a per-user opt out, a browser extension could easily block it.

> Will they be releasing the datasets for how these ML models are constructed?

"The browser uses machine learning algorithms to develop a cohort based on the sites that an individual visits. The algorithms might be based on the URLs of the visited sites, on the content of those pages, or other factors. The central idea is that these input features to the algorithm, including the web history, are kept local on the browser and are not uploaded elsewhere — the browser only exposes the generated cohort."

This code will be in the browser, which is open source.

> Do sites in "incogito mode" contribute to analysis?

"All sites with publicly routable IP addresses that the user visits when not in incognito mode will be included in the POC cohort calculation."

(Disclosure: I work on ads at Google, speaking only for myself)

supermatt · on March 4, 2021

So to reiterate:

> Is it be possible to disable?

No. You suggest this is possible with an extension, but it clearly wouldn't be permitted in chrome store (as per well known ad-blockers). Why wouldn't it be a setting in the browser? Clearly anti-user practices.

> Will they be releasing the datasets for how these ML models are constructed?

No. You suggest that the code is in the browser (which i doubt - its certainly not at the moment!). Even if it were it is effectively useless without the training datasets. i.e. its a blackbox.

> Do sites in "incogito mode" contribute to analysis?

No (at least this one is a positive "No") :)