How SEO Ruined the Internet

RobertoG · on April 6, 2020

It kind of ruined content too.

Why it ruined content? You are not the only one that is searching for the answer to that question. Keep reading to know why SEO ruined content.

Many people think that SEO ruined content, in this post, we are goin to explain why SEO ruined content. When you finish reading this post, you will know why SEO ruined content.

In the last years we have observed a grown in the quantity of content created, unfortunately, as we are going to explain in a moment, it has been ruined by SEO.

¿Is it SEO really the reason content was ruined?

Some people argue that SEO is not really the reason content was ruined, we will review all the reason why SEO could be really ruining content.

Please, click "next" to know why SEO could be ruining content.

RobertoG · on April 6, 2020

I just searched (1) "SEO ruined content" in duck duck go and this Hacker news page is the first entry.

We are very lucky that, as far as we know, an accumulation of irony doesn't create black holes.

https://duckduckgo.com/?q=SEO+ruined+content&t=canonical&ia=...

warent · on April 6, 2020

HA! People collect significant paychecks for what you just did. I love this so much.

hombre_fatal · on April 6, 2020

Currently page 2 on Google.

To be fair, "SEO ruined content" is a pretty specific search string and doesn't even show up in the results that out-rank this submission. This comment specifically talks about "SEO ruined content" and is, correctly, a good results candidate.

bolasanibk · on April 7, 2020

>Currently page 2 on Google.

And now on first page

abakker · on April 7, 2020

5th result.

oldtapwater · on April 7, 2020

It's the first one for me now.

arushisomani · on April 7, 2020

Likewise.

seobuddy · on April 10, 2020

Very specific indeed, no data on Ahrefs for this keyword.

Keyword Difficulty < 15

Ahrefs estimate that you’ll need backlinks from ~17 websites to rank in top 10 for this keyword

kaixi · on April 7, 2020

First result on Google now. Well done!

mcv · on April 7, 2020

Even above the article that we're discussing. Hilarious. And a very good demonstration of the problem.

ksolanki · on April 8, 2020

Ok, now that we agree this is neat (I am impressed!), what’s the solution?

Google or not, what can help me discover the right recipe, the right content, the right answer to my coding error?

Search algorithms that keep “mutating” even randomly? Ranking system? What?

mcv · on April 8, 2020

I fear it's incredibly hard to fight fake content with automated systems. You need the ability to understand the content to determine whether the content is actually meaningful. For that, you need far better intelligence than just some fancy text processing.

You need people, basically. If you want to automate it, you need some way to figure out what people think about that content. This has been tried by counting how many people link to it, or rate it highly, but those methods have also been gamed to death.

humanbeinc · on April 6, 2020

Awesome.

gauravjain13 · on April 7, 2020

Neat!

MisterBastahrd · on April 6, 2020

Yep.

Say I want a recipe. A tried and true delicious recipe. Can I search and just find a recipe? Nope. Through the magic of SEO, I now have to scroll through 15 paragraphs of somebody's life story before being able to examine the time and ingredients.

How much time and energy was wasted on building "tag" systems? All those fun little term link clouds that sites used to have. I know I wasted time on it. I had something that would scan for words and their synonyms and tag articles, a rescan feature for tags that got added after the fact, and various other utilities.

com2kid · on April 6, 2020

Filler content is the cost of ad supported content sites.

Pay for a Cooks Illustrates or NYT Cooking subscription and all these problems go away.

dceddia · on April 7, 2020

And if you want it all in one place, an entire compendium of tried and tested recipes, pick up a copy of one of Cooks Illustrated's New Best Recipe books, even an older one.

I have a copy from probably 5 years ago that seems to be out of print and can be had for $7 used. It's 900+ pages of well-tested recipes where each one has a narrative about how they tested and arrived at the final recipe.

com2kid · on April 7, 2020

The website is searchable and has videos. :-D

Cooks Illustrated is amazing, the rigor they go through testing everything, and adapting to available ingredients, makes for such incredible reliability!

jrochkind1 · on April 7, 2020

I have never understood what the filler content on recipe pages is accomplishing though. It somehow makes the page do better in google? How?

wolco · on April 7, 2020

Plain and simple, Google won't rank you without it. Google looks for additional filler when ranking. Otherwise you would see different kinds of sites.

DeathArrow · on April 7, 2020

Then we have to blame Google, not the authors.

jrochkind1 · on April 8, 2020

Right, but is there any explanation for why a recipe page with several pages of filler content ranks better on Google though?

I assume (?) it is not google engineers tuning for this particular outcome on purpose...

Not all that plain and simple.

com2kid · on April 7, 2020

More space for ads.

egypturnash · on April 7, 2020

Or you could buy a cookbook. There's a bunch of them out there. Some are good, some are bad, some are long-lived classics, some are deep dives into a particular cuisine by someone who knows it inside out.

You'll still have to suffer through a certain amount of SEO bullshit, you probably don't wanna just go to Amazon and try buying the best-selling cookbook in whatever cuisine you're interested in because that's got its own Amazon-specific SEO clogging things up, but...

giardini · on April 7, 2020

My local libraries always have donated cookbooks for sale for 25 cents to a dollar. What is surprising is their number and variety.

DeathArrow · on April 7, 2020

Books? You mean those old fashioned things made out of paper with black signs on it?

Most are black and white only and don't even have pictures.

Grandpa used to read that.

egypturnash · on April 7, 2020

Yes! Books. Most cookbooks have a lot of full color pictures. They also lack ads, if they have a story to give context to the writer’s relationship with the cuisine it will be a few paragraphs introducing the whole book rather than at the head of every recipe. They do not have any DRM; once you buy one it is yours to keep, give, or loan as you will.

If one gets wet (not at all inconceivable for something used in the kitchen) you merely have to dry it out. It will be slightly wrinkly but it will work fine, unlike a wet phone/tablet/laptop. It’ll have the dish you’re interested in plus a whole lot more, you probably won’t want to make every single one of them but there’s probably gonna be a few that look worth trying.

It will not track you. It is easy to store where it is needed. If you want to make notes - maybe you loved this dish and hated that one, maybe you made a variant that your SO loved - it’s easy to make them with a pen or pencil, and have them visible without any extra interaction. It’s a pretty useful technology!

lerchmo · on April 6, 2020

I find this particular example particularly gross. Recipe's are no doubt a heavily searched category. Why does google allow pure CPM hacking garbage websites to win the top spots? does it have to do with the google ads from top to bottom?

wlesieutre · on April 6, 2020

>Why does google allow pure CPM hacking garbage websites to win the top spots?

Something bothered me about this question, and I think it's the way it frames Google's role as being a passive participant.

Google doesn't "allow" anything. Google writes the rules and picks the winners.

When you search for "chocolate chip cookie recipe," Google's search algorithm goes "Here's a nice webpage with Grandma Betty's life story and a paragraph about how to make chocolate chip cookies at the bottom. This is what you were looking for."

Recipe sites look like they do because Google forces them to look like that if they want Google to send them any search traffic.

Is there a different algorithm that would give more useful results? Is there a way to rank the sites on how well they present the information you were searching for? Is there a way to factor in whether a site has good recipes or terrible ones? I don't know, but I don't have a giant advertising money fountain and teams of very well paid engineers.

Like you hinted at, I think it's reasonable to suspect Google for not having an incentive to fix this. They get their ad money either way, and they probably get more of it from worse sites. As long as it's good enough to keep people from switching to other search engines en masse, they're not losing anything.

kristopolous · on April 6, 2020

There's also manual tweaking. It's why known scam phishing sites and DMCA takedowns don't "win".

It's not just some simple disinterested mathematical orchestration without any engagement or other layer

DeathArrow · on April 7, 2020

Long time ago having your site in a high quality curated directory like DMOZ boosted your search ranks a lot.

I was an editor for a few categories at DMOZ. Not only I allowed only good content in our categories, but I checked older approvals from time to time to see if they behave. I had to delist some websites who thought they can trick us.

swasheck · on April 7, 2020

Seems like it’d be fairly straightforward to identify a post by genre and then weight for terseness.

stevekemp · on April 7, 2020

Even worse when you do find the recipe it uses some crazy measuring system!

What is one "cup" of flour? It obviously depends on how fine the flour is. Use grams! Use ounces! But above all use some kind of real measurement!

Spivak · on April 7, 2020

> What is one "cup" of flour?

8 fl oz of flour spooned into the measuring cup or 4.5 oz. by weight.

I'm not at all trying to be snarky here. Like any other specialization people who cook have organic terminology that is useful to the in-group but confusing to the out-group. A /24 isn't the same as a class C network but we all know what is being conveyed.

And you'll get much more accurate measurements of small quantities measuring by volume instead of weight since kitchen scales aren't that accurate. Knowing that 0.25 tsp is about a gram for just about any granular thing will probably do better than your scale.

It also helps convey the sig figs in recipe. Very few recipes have a tolerance of 1 g. Even finicky bread doesn't get that accurate so it's ridiculous when some bloggers write something like 113 g when without question the recipe was originally formulated to be 4.0 oz and you just messed up the conveyed tolerances by blindly converting.

stevekemp · on April 7, 2020

I understand how to make bread, bakers-percentages, etc. It's just that when it comes to flour a cup is a terrible measurement, due to the way it packs and the different volume involved in various varieties/types of flour.

Most of the time when I'm cooking and I see an American recipe I just google the conversion; a cup of milk is easy to deal with. Just a minor irritation most of the time, but for some things it matters.

83 · on April 7, 2020

Yeah, cups of flour annoy me to no end. The difference between a cup of packed down flour in the bag vs flour that's been sifted is something like 30% by my rough estimate.

arpa · on April 7, 2020

Psst, dude, i recently found out there are conversion tables for Freedom Units!

umvi · on April 7, 2020

Most Americans don't have a kitchen scale

stevekemp · on April 7, 2020

I know that different cultures are different, but I always think of Americans as being (kitchen) gadget-obsessed.

Maybe I've watched too many infomercials and soaps. I've reached a point where I know Americans don't have (electric) kettles, but a scale seems like a necessity for anybody who cooks.

(I guess there are lot of people, American, or otherwise, who just don't cook. So I'd understand in that case. But cooking without a scale just seems surprising. Even where I come from a grew up with a balance-scale with brass weights. Never hugely accurate, but always available.)

umvi · on April 7, 2020

It's because Americans mainly cook with volumes ("1 cup flour") and not masses ("100g flour"). Just look at any American recipe book - it will use volumes instead of weights.

richjdsmith · on April 8, 2020

Wait, Canadian here, American's don't have electric kettles?

stevekemp · on April 8, 2020

Apparently they're unusual and not at all common.

People seem to prefer to use the kinda old-fashioned manual kettles you place on the top of a stove.

coryrc · on April 7, 2020

It lets you know whether the recipe is any good.

You searching for a bibimbap recipe and their page is split into Hangul and English pages and this was the recipe their grandma uses? You bet your ass it's going to be good.

You find something on allrecipes.com, how do you know if this is a good recipe? Only if you already know it is.

octocop · on April 6, 2020

This is sooo true, every time i look for a recipe online it's all i ever see.

nullc · on April 6, 2020

Don't look for crafting-ish instructions... the number of outright false farmed content is astonishing.

"Here is this stock photo of something near, we're going to teach you how to make it!" {4 unrelated steps} {stock photo again} "Tada!"

vintermann · on April 7, 2020

And it's surprising how many sites that are half decent on its own, if you visit it without an ad blocker they still have Taboola ads about how person in {your location} got rich in the way they don't want you to know.

bgrainger · on April 7, 2020

There's a Chrome extension to fix that: https://github.com/sean-public/RecipeFilter

> This Chrome browser extension helps cut through to the chase when browsing food blogs. It is born out of my frustration in having to scroll through a prolix life story before getting to the recipe card that I really want to check out.

opan · on April 7, 2020

Glad to hear this exists, but is there anything like it for Firefox?

kasbah · on April 7, 2020

Most Chrome extensions can now be run on Firefox without alterations.

https://addons.mozilla.org/en-US/firefox/addon/recipe-filter...

Qub3d · on April 7, 2020

I tried loading the source as a temporary add-on, it looks like this is not one of them.

kasbah · on April 7, 2020

Did you see that I linked to the Firefox version of this extension?

rchaud · on April 7, 2020

I cannot say this enough: buy a recipe book.

Online recipe sites will always be absurdly verbose, because if they just posted a list, Google would either rank it poorly because it is "thin content", or scrape it and post it as a Featured Snippet. There is seemingly no in-between.

The site authors have no choice but to fill it with a bunch of warbling about their childhood, to meet Google's minimum word count, which of course the company swears doesn't exist.

rdtwo · on April 8, 2020

Book recipes have a different problem that they need to fill 100 or so pages of content and most authors have maybe 3-5 good recipes. So you get a ton of filler as well

Spivak · on April 7, 2020

See there is a double edged sword here because outstanding recipes are thesis papers explaining the motivation, concepts, prior art and research, history, experiments, methodology, equipment, and then finally, optionally even, the reipce. They should be long and verbose. You should learn something by reading a recipe.

The problem of course is that search crawlers can't tell the difference between between the two and so long but empty content is treated the same.

arkitaip · on April 6, 2020

Recipes have become hilarious as they have transitioned into storytelling pieces for whoever is trying to promote their lifestyle blog.

ruytlm · on April 7, 2020

This is because of a quirk of copyright; in the US at least, a recipe itself is not considered copyrightable if it is a mere listing of ingredients; however, they can be covered by copyright when they have "substantial literary expression – a description, explanation, or illustration, for example – that accompanies a recipe or formula…”.

If you want some recipes that don't have all that nonsense associated, I'd recommend the BBC's Food site[0].

[0]: https://www.bbc.co.uk/food/recipes

markdown · on April 7, 2020

Which for some reason competes with another BBC website, https://www.bbcgoodfood.com/

8ytecoder · on April 6, 2020

I skip to print recipe straight away. The whole story is for SEO and I don't honestly think anyone ever reads it.

Semaphor · on April 7, 2020

My wife recently sent me a recipe that had a link right at the top: "Jump to recipe". It was amazing :)

tincholio · on April 7, 2020

This reads a lot like Webb and Mitchell's gift shop skit. https://youtu.be/7MFtl2XXnUc

randomdude402 · on April 7, 2020

I like your good article about content click here to see how I think about good content also

meesterdude · on April 6, 2020

this x1000! By pandering to the algorithm of google, a lot of great content has become watered down, softened, or impacted in other ways. Titles have to be worded a certain way, article length has to be a certain length...

hoping someone else chimes in with other thoughts because I used to know more about this, and SEO cheapening quality content was one of the key takeaways I had.

owlninja · on April 6, 2020

Recipe blogs are the worst! Just get to the goods already!

warent · on April 6, 2020

Years ago when I was a child growing up in the icy woods of Alaska, my parents ...

... And that was when we realized that the kindling was all damp. What to do! Four hours we ...

... it reminded of our travels in Italy ...

... And that brings us to our famous three-ingredient peanut butter cookie recipe:

ip26 · on April 7, 2020

I once came across a "world's most amazing curry recipe" that had two ingredients; rice, and a jar of storebought curry sauce. It was actually kind of tragic.

DeathArrow · on April 7, 2020

Google has transformed any webmaster into a writer.

After a few years of writing recipes, one can transition to screenplays and novels. :)

taberiand · on April 6, 2020

It's like we've flipped APIs on their heads - humans use tools to obtain the minimal data (just the recipe), while the bots make requests and receive the whole dump of pointless, extraneous fluff in return.

TeMPOraL · on April 6, 2020

And here comes the business trick: API access is paid and restricted and requires you to enter a relationship with the data provider. Websites are purposefully made machine-unreadable, because otherwise a lot of people would use third-party scripts to avoid viewing ads and filler content.

The Internet is the exact opposite of what it was meant to be. Instead of giving efficient access to high-quality information, it gives inefficient access to watered down, spread-down, low-quality interaction. Site after site, query after query, we all continuously pay a little tax with our time.

lioeters · on April 7, 2020

> Websites are purposefully made machine-unreadable.

This is a sad result of incentives which ruined RSS, semantic web, and the original purpose of the Internet: to effectively share and communicate information globally.

Advertising (including propaganda and surveillance) and copyright are major forces in that ruination - not to say they don't have legitimate uses and benefits, but their concentrated financial interests conquered the domain. It's a kind of colonialism of public (cyber)space and collective mind.

Tax is an apt analogy, since the power lies in the control of "media" in the most general sense, to force itself in every transaction between things and people.

I believe "dis-intermediation" will help improve the free and direct flow of information. That is, to reduce the middle layers - the gate keepers, the walled gardens, intercepts (lawful or not), filters - by re-decentralizing the net.

zozbot234 · on April 7, 2020

This only really applies to the commercially-focused part of the Internet, though. Many Internet sites are still hosting academically-focused, non-commercial or broadly pro-social efforts, and those seem to be among the quickest adopters of Linked Open Data standards which are precisely meant to make content machine-readable, sometimes with standardized and freely available API-like endpoints for querying the underlying datastore.

DeathArrow · on April 7, 2020

That is because the ad revenue model. Not many people would pay for subscription and I don't know of other working revenue models.

TeMPOraL · on April 7, 2020

More people would be willing to pay for subscription, directly or indirectly, if ad revenue model didn't exist. Otherwise, "free with ads" outcompetes all. That's why I'm in favor of banning "free with ads" wholesale.

DeathArrow · on April 7, 2020

I am not on a high moral ground to talk about ads.I am a web developer, yet I use ublock Origin.

Now, that I have some money, I am willing to pay for subscriptions, because that would mean high quality content, not because of ads. However, there were times in my life when I barely had what to eat and I barely afforded to pay for the Internet connection and paying subscriptions was the last things I was willing to do.

I suppose many of the 7 billion people living on Earth are poor enough to not afford paying for subscription.

rdtwo · on April 8, 2020

A lot of the revenue models suck, if you pay they just put your name in a database sell it to everyone that offers still give you ads and make it impossible to cancel. I wanted to buy cooks illustrated for example but after reading about their business practices I chose not to.

awithrow · on April 6, 2020

Many now have an anchor directly to the recipe. I try to book mark those so I don't have to read about the author pontificating on the changing of the seasons or how an ingredient in the dish reminds them of childhood for the 800th time.

bgrainger · on April 7, 2020

There's a Chrome extension to fix that: https://github.com/sean-public/RecipeFilter

> This Chrome browser extension helps cut through to the chase when browsing food blogs. It is born out of my frustration in having to scroll through a prolix life story before getting to the recipe card that I really want to check out.

bgva · on April 7, 2020

If I never have to read another article with "XYZ Is Happening. Here's What it Means" or anything to that effect as a headline, it will be too soon. SEO has created lazy writing, and as a former journalist, I hate what it's happened to my one-time profession.

I can also do without sites that think they're being quirky with their newsletter popups. "Click here to subscribe" or "I'm not interested in staying informed/saving money/being in shape". Stop. You're trying too hard.

spiritplumber · on April 6, 2020

Thanks, I hate it

cookiengineer · on April 7, 2020

Every time I land on websites that have such content I am wondering why anybody would want that kind of traffic.

I mean, don't they have server costs? It's like they own an infinite amount of money bucket that constantly gets refilled somehow.

Why are advertisers even paying them for such impressions? Are they so ridiculously retarded in their business model? Can we all hurt advertisers by setting up a website like this?

ip26 · on April 7, 2020

I know little about the ad business but the insinuation I've heard is that the advertisers don't want those impressions, but struggle to block these sites for one reason or another. In other words, it's basically a scam.

tonyedgecombe · on April 7, 2020

I think most advertisers aren't even aware, the ad platforms are so full of dark patterns it's hard to avoid this stuff.

rchaud · on April 7, 2020

That traffic is monetized in a few ways:

- Autoplayinig video ads

- Affiliate marketing (meal kits, cooking courses, etc)

- Adsense display ads

- Taboola/Outbrain

These don't bring in much money on a per-site basis, but the owners usually own multiple sites, so it can add up.

If you're able to get the site to grow, it becomes the easiest digital property in the world to maintain. The content doesn't need to be fresh, or engaging. People will always search for generic recipes.

pixelrevision · on April 7, 2020

It’s all very parasitic and relies on a lot of disconnect. The advertisers often don’t even know what they are buying as these things go through “exchanges” that are basically computers buying and selling as real estate. The smaller the business the more likely they are to end up scammed by this because they will have less tools to monitor things.

pbz · on April 7, 2020

Seems to me like you have it backwards.

The reason you even got a chance to click on it is because they wrote it that way. If they didn't, it wouldn't be as visible in Google = nobody would see it. When most websites do this (to survive) folks don't have a choice.

Also, advertisers don't pay for something that's invisible.

DeathArrow · on April 7, 2020

To base your income and your future on something as volatile as Google search seem to me a terrible idea.

What if algorithm changes will tank your all your fleet of poor content websites you've spent last few years on?

ori_b · on April 7, 2020

Then you shut down the servers and move on. You got the ad dollars at that point, what do you care?

baud147258 · on April 7, 2020

That reminds me of an ad network, which was usually displayed next to webcomics, with nearly all the adds pointing to other webcomics, who all had ads from the same network… I had no idea how that network was making any money or paying anything to the websites.

stryan · on April 7, 2020

Perhaps you're thinking of Hiveworks? They're less an ad network and more like a publishing company for webcomics. Many of those types of "webcomic ad-rings" are.

vintermann · on April 7, 2020

We have the lyrics to the sad song you're singing right now.

That is, we will have them, once someone adds them.

TheSpiceIsLife · on April 7, 2020

Deary me.

I wish I'd be around to see what future historians / archaeologists make of this.

nicbou · on April 7, 2020

This is especially prevalent for simple questions that would normally have simple answers.

I'd like to know when to water my mint plant without reading about the whole history of mint.

ludsan · on April 7, 2020

hilarious... reminds me of my favorite Mitchell/web parody: https://www.youtube.com/watch?v=7MFtl2XXnUc

Animats · on April 6, 2020

The moment when Google turned to the dark side was in 2005-2006, when they stopped sponsoring the "Web Spam Squashing Summit" and started sponsoring SEO conventions.

"There's going to be a Web Spam Squashing Summit next week: Thursday, Feb 24th. (2005). Technorati is organizing the event (thanks guys!) and we're hosting it on-site at Yahoo in Sunnyvale. The main goal to get the tool makers in a room together to talk about web spam, share info, and brainstorm. So far AOL, Google, MSG, Six Apart, Technorati, and Yahoo are on board. I hope we'll also have representation from Feedster, WordPress (hi Matt), and Ask Jeeves and/or Bloglines too."[1]

The next year, in 2006, Eric Schmidt, Google CEO, addressed the Search Engine Strategies conference.[2]

"The search advertising market – a tremendous credit to you and to the organization that built this conference..."

And that's when Google turned evil. From trying to stop search spam, to promoting it.

[1] http://jeremy.zawodny.com/blog/archives/004256.html

[2] https://www.google.com/press/podium/ses2006.html

josephjrobison · on April 7, 2020

I can tell you as an in-the-trenches SEO that Google is 100% not in cahoots with SEOs in any way.

Google WILL bend over backwards for paid search agencies that direct their clients ad spend through AdWords - but this is not SEO. These paid search agencies that funnel millions to AdWords will get invited to Mountain View, get paid lunches, visits from Googlers, etc. - but this is SEM not SEO.

What does Google do for SEOs? They have a handful of ambassadors that answer questions on a weekly basis. While they are generous with their time (https://twitter.com/JohnMu), they keep things very close to the chest regarding the details of their algorithms. They will send some speakers to some SEO conferences, but will rarely if ever sponsor an SEO conference unless it's a part of a broader paid advertising or marketing tech conference.

Google's Algorithm updates - https://searchengineland.com/library/google/google-algorithm... - rolling out every month or so are notoriously a black box and frustrate SEOs to no end!

droopyEyelids · on April 7, 2020

You base your judgement on the legal and public communication between Google and SEOs.

Everyone else is basing their judgement on the effect google has on the Web.

mcv · on April 6, 2020

I totally agree. Google has become useless for about half my searches. It gives me only the biggest, most commercial or most popular results. Anything obscure is impossible to find.

I'd like to have a search engine where you get only the most obscure, hard-to-find content. One where you can tweak the kind of content you're looking for, or even switch between different modes: am I just looking for the definition of a common but complex term, am I looking for a specific article that I vaguely remember a phrase from, do I want something I've seen before, or am I in the mood to discover new, unexpected things?

Also, I just don't want to see results from some sites. Let me tweak the importance of some sites, rather than relying on Google's gameable algorithms.

eitland · on April 6, 2020

To me it seems the problem with Google goes far deeper than struggling with bad SEO.

- For years it has been next to impossible to get a result that is faithful to the search you actually typed in. This is not dependent on SEO spammers at all, only on Googles unwillingness to accept that not every user is equal and some of us mean exactly what we write, especially when we take the time to enclose our queries in double quotes and set the "verbatim" option.

- Ad targeting has been so bad it has been ridiculous. Yes, on average it works but around the edges it is somewhere between tragic and hilarious. For ten years after I met my wife the most relevant ads Google could think of was dating sites. Not toys, not family holidays, not tech conferences, not magazine subscriptions, not offers from local shops, but scammy dating sites that was so ridiculous that I cannot imagine how most people would fall for them. (For a while I wondered if this was all a fluke but now I have confirmed it happens to others in my situation as well.)

- Also in other areas it is becoming ridiculous. For example: what is the idea behind aggressively showing me captcas while I'm logged in with two different google controlled accounts, one gmail and one gsuite, both paid?

mcv · on April 6, 2020

> "For years it has been next to impossible to get a result that is faithful to the search you actually typed in."

Good lord, yes. If I type two words, I want preference for sites that contain both of them, yet the first results all have either one or the other, because surely I must be more interested in a popular site that uses only one of these, right? Google is sometimes too smart, trying to interpret exact words I type as vaguely related words. Sometimes that's relevant, but often it's not.

> "For ten years after I met my wife the most relevant ads Google could think of was dating sites. Not toys, not family holidays,"

They have a tendency to show you ads for exactly the thing you don't need anymore because you already found it. I don't think AI is in any danger of taking over the world just yet. Except with bad advertising, apparently.

userbinator · on April 7, 2020

The "AI" that Google's search engine seems to have is definitely feeling more "human" over time, but not in a good way --- it's like a stupid salesperson who has trouble understanding what you're trying to find. An analogy I have is that you go into a pet shop and ask for a black cat, and instead the salesperson shows you black dogs, white cats, and green gerbils (because they're absolutely cool these days and you wouldn't want to miss out on a great deal, no?)

DeathArrow · on April 7, 2020

They should have a checkbox to disable AI. :)

Something as simple as Apache Lucene would beat Google's algorithm any day for relevance.

But I guess Google doesn't care for relevance, they just care to show you ads.

masswerk · on April 6, 2020

> "They have a tendency to show you ads for exactly the thing you don't need anymore because you already found it. I don't think AI is in any danger of taking over the world just yet."

There's an eschatological trait to targeted advertising, as it seems to be all about past sins. So I'm not too sure about your evaluation and AI's own claims…

wizzwizz4 · on April 6, 2020

> I don't think AI is in any danger of taking over the world just yet.

The scary thing about AI is that, even as the algorithms have greater and greater intelligence, we're still not much closer to teaching them to do what we want them to do. They can game the system better than ever, and then the universe is tiled with surgical masks.

mcv · on April 6, 2020

So if AI ever takes over the world and kills us all, it will probably be because it failed to understand what we actually wanted.

WalterSear · on April 6, 2020

https://wiki.lesswrong.com/wiki/Paperclip_maximizer

https://www.decisionproblem.com/paperclips/index2.html

wizzwizz4 · on April 7, 2020

If "what we actually want" even comes into consideration, we did orders of magnitude better than the current industry standard. (Poor consolation, I know.) Right now, the vast majority of AIs don't even have a concept of "human desire" – probably none of them, to be honest, though some that are good at manipulating their handlers might've come close to a particularly stupid dog's understanding. This is at the core of the Friendly AI problem: https://wiki.lesswrong.com/wiki/Friendly_artificial_intellig...

Just because we created the AI doesn't mean it'll care about us. That's like saying a maths problem will try to make your favourite number its answer, just because you wrote it. No, you are the one who must make its answer the number you want. It won't happen by chance.

Corollary: you can't patch broken FAI designs. Reinforcement learning (underlying basically all of our best AI) is known to be broken; it'll game the system. Even if they were powerful enough to understand our goals, they simply wouldn't care; they'd care less than a dolphin. https://vkrakovna.wordpress.com/2018/04/02/specification-gam...

And there are far too many people in academia who don't understand this, after years of writing papers on the subject.

TeMPOraL · on April 6, 2020

Or perhaps because it was rotten to the core, as it was initially created for a malicious purpose (most likely advertising).

Kye · on April 6, 2020

Maybe this is how AC finally reversed entropy.

https://www.multivax.com/last_question.html

spiritplumber · on April 6, 2020

Insufficient data for an answer.

Tomte · on April 6, 2020

> They have a tendency to show you ads for exactly the thing you don't need anymore because you already found it

Say you searched for a TV a month ago. Now you're seeing lots of ads about TVs. Stupid Google.

But is it? A substantial fraction of those people are returning their TV because something is wrong with it. Now they are looking for another TV set.

Sure, the majority keeps their TV. But it is still profitable to target all those TV buyers, because they have self-selected into the set of people who really want a TV now, and they are willing to pay.

Reaching the fraction of those who need another one is probably[1] very lucrative.

[1] I'm sure Google has run the numbers

pbhjpbhj · on April 6, 2020

Agreeing: This thread of thought comes up semi-regularly here, I've argued similarly to you.

People will rebuy good products, or be stimulated to replace other similar products (bought a new TV for the kitchen, now the lounge TV seems dated, new boots feel awesome get another pair for when they wear out, new $thing is fun buy one for friends birthday.).

There's also a big place for brand enforcement. Show Sony stuff, to remind someone ['s subconscious!] they bought Sony.

A tertiary effect is what I call the "Starbucks Purposeful Bad Naming effect" - you get ads for the exact TV you bought -- beyond the brand reinforcement, etc., you also get to tell everyone you meet a weird story about how "internet advertising is broken ..." and "yes, my new Sony TV is great thanks, you should get one".

Those ad agencies aren't stupid; they have metrics for their metrics and have tracking that can tell you to the second when your gut bacteria burps ...

TeMPOraL · on April 6, 2020

> Those ad agencies aren't stupid; they have metrics for their metrics and have tracking that can tell you to the second when your gut bacteria burps ...

Stupid they aren't, but I don't think they're smart in the way they are.

Ad attribution is a hard problem. Or, in other words, it's hard to estimate which $ spent on which advertising activities generated how many $ of profit where. That gap is a huge opportunity to scam the product vendor out of their money.

So the ad agency has a metric for their metric, and their reports overflow with numbers and various charts shaped like food or aquatic mammals. But does that mean anything at all? It might not. Statistics is hard, and as long as the vendor isn't better at it than the agency, money can be made. I used to work next desk to a group of content marketers who had no fucking clue about what their numbers mean, but their customers didn't have a clue either, so they happily paid money in exchange for reports that showed the Facebook campaigns "worked".

Now advertising industry is large, and by definition filled with companies that aren't a paragon of virtue and honesty. These companies specialize, providing building blocks and platforms for each other, and they compete internally. It's not like people building tools for lies and manipulation are suddenly honest when dealing with their in-industry customers and competitors. After all, convincing advertisers that your A/B testing package is worth the money requires... well, advertising.

So my personal view on the industry is that it's mostly self-reinforcing bullshit. Doesn't change the fact that it generates stupid amounts of money, though.

FridgeSeal · on April 7, 2020

I used to work in Ad Operations (literally buying ad space and running campaigns) and can attest to the accuracy of this.

Clients were clueless: they had their metrics and they looked at them often, but from my interactions, deep understanding of those metrics and the realities behind them was lacking. The chain of technologies was patchwork and would rarely support all the required features from ad-serve back up to agency: click and view attribution was especially flaky and inconsistent. The adserving environment we worked in (in app) often had issues with view attribution, and we'd tell clients that, but we knew for a fact that some of our competitors didn't and clients would always ask us why our view attribution numbers were worse.

Combine that with more suspect behaviour from suppliers and competitors than you can poke a stick at (questionable traffic sources and campaigns that were probably outsourced from under you, suspect and plausibly forged numbers, etc) means that most of the metrics are plausibly poisoned with illegitimate data to a degree that is difficult if not impossible to nail down, which more or less makes lots of those metrics worthless.

> So my personal view on the industry is that it's mostly self-reinforcing bullshit. Doesn't change the fact that it generates stupid amounts of money, though.

Couldn't agree more.

type0 · on April 7, 2020

> People will rebuy good products,

The big reason advertisers show the ads for products you already purchased is also to reinforce their brand. If you buy some stupid cable, you wont remember the name of the company that made it, but you will if they show you the add couple of times in a row and will likely to buy the things there again even if it not the same product.

pbhjpbhj · on April 7, 2020

Agree, that was in my 3rd paragraph above.

ric2b · on April 14, 2020

> Say you searched for a TV a month ago. Now you're seeing lots of ads about TVs. Stupid Google.

> But is it?

Yes, because Google knows who started searching for TV's again and who didn't.

DoubleGlazing · on April 6, 2020

That's why I miss pre-Google search engines such as AltaVista and alltheweb. "If you searched for "some obscure string of words" you would only get results that matches that exact string. I really don't like how Google just chooses to vary the spelling of your query when a match isn't found. I often search for electronics components using their part number. I'll type in something like "P204PPX" (a random code I just made up) and despite there being no match Google still gives me pages of results that are nowhere near what I was looking for.

And the worst thing is that this is all done to keep those ad dollars flowing. Look at how many companies always have a paid advert associated with their name when a search is made. They are paranoid about losing rank due to Google fiddling their algorithm or someone else doing a better SEO job using their brand.

KMag · on April 6, 2020

Google used to respect search operators, and dramatically tone down query optimization for queries that contained operators. As I've written elsewhere, I suspect learn-to-rank is to blame[0], by optimizing ranking for generic sloppy queries despite your query being very focused.

[0] https://news.ycombinator.com/item?id=22747889

fingerlocks · on April 7, 2020

If a match isn’t found with your verbatim query, then it falls back to something similar. At least that’s my experience.

Random string of text still matches exact in quotes: https://www.google.com/search?q=%E2%80%9CI%20really%20don%27...

callalex · on April 7, 2020

> - Also in other areas it is becoming ridiculous. For example: what is the idea behind aggressively showing me captcas while I'm logged in with two different google controlled accounts, one gmail and one gsuite, both paid?

To intentionally discourage you from using Firefox so you give in and switch to their stalker browser.

BostonFern · on April 6, 2020

You're not their customer, advertisers are, so it's only natural that the ads you see aren't personalized. That's never been the goal.

It is, however, technically a potential benefit that the more exactly advertisers can target you, the more relevant ads you could be seeing, which is a wonderful sales pitch for users who are agnostic anyway, but that's not how advertising works in practice.

eitland · on April 6, 2020

> You're not their customer, advertisers are,

1. I'm well aware of this

2. It doesn't contradict anything I wrote

3. I've read it so many times and seen it misapplied so many times it is getting annoying.

> so it's only natural that the ads you see aren't personalized. That's never been the goal.

I doubt it was the intent of the advertisers to waste expensive impressions on people who weren't in the target audience at all, so I'm pretty sure they expected some personalization WRT which customers gets what ads.

I also very much doubt that it was Googles intention to annoy me to the point where I trash them in public foras, I just don't think they're capable of fixing it anymore as they are way to busy "being Google", e.g. doing cool stuff while not listening to customers (I was planning to add more here, but this single example seems to summarize it well.)

I recognize I might be a bit more direct than usual here and you aren't responsible for the first 97 times I've seen this meme here but as an answer to my question it is not applicable as far as I can see an also generally that meme is just noise here at HN now.

(Anyone who is actually in todays 10000 lucky WRT the "you're not the customer" meme, feel free to prove me wrong.)

BostonFern · on April 6, 2020

Meme? If hearing this uttered bothers you this much, then maybe complaining about poor relevance in Google ads isn't such a good idea.

I'm sorry that I seem to have offended you.

eitland · on April 6, 2020

I'll try to explain:

My post may contain a meme but it was directly relevant to the post above.

Mentioning that I'm not Googles customer is significantly less relevant (I think irrelevant) when it is obvious that it should have been in the actual customers best interest to avoid spamming me with expensive and utterly irrelevant ads.

type0 · on April 7, 2020

A meme, you say. Searching, you're doing it wrong.

rnd0 · on April 7, 2020

>You're not their customer, advertisers are

Then it's high time everyone but advertisers stopped using google search and start using anything but.

If advertisers are their customers, so be it; let them have it.

capableweb · on April 6, 2020

> Yes, on average it works

Maybe we frequent vastly different websites, but this has absolutely not been true for me, even for companies who are supposed to be experts at using their data. I don't think I've ever seen an ad that has actually been relevant, and I'm not even trying to hide my habits or behaviors.

For example, take Amazon. Their ads all over the web frequently recommends me stuff I've already bought just a month ago, the very same product. Or the products they recommend are way out of my zone, like woman clothing while I never purchased woman clothing or anything close to it.

So, I'm not sure how the ad market even goes around and my friends are describing the same behavior from the ads, even from companies that have my entire shopping history already (like Amazon)

TeMPOraL · on April 6, 2020

> I'm not sure how the ad market even goes around

The ad market's business isn't delivering right ads to you, it's convincing people paying for those ads to part with their money. It doesn't have to work well, as long as it works a bit, and there isn't any better alternative around.

eitland · on April 6, 2020

> > Yes, on average it works

> Maybe we frequent vastly different websites...

I think we agree. What I mean is on average it works for Google, not that it works for us. They still makes boatloads of cash.

For what I know the targeting is equally bad for you and me and everyone and they are just convincing advertisers that is worth paying for despite this.

capableweb · on April 6, 2020

Right, that's a good point, that it's working on their side with convincing the advertisers. Thanks for clarifying so I could understand!

greggman3 · on April 6, 2020

There are definitely sites who's results I wish I could ban from my results. I won't visit them so they are just a waste of space. My short list, thillist, collider, vulture.

Also related to SEO I think is every cooking recipe seems to be 6 to 8 large images and a bunch of unneeded text followed by the recipe 8 to 12 screens down. AFAICT it's entirely not related to me getting to the recipe and instead either a pattern for SEO or ads.

thomas · on April 6, 2020

A Pinterest ban would greatly improve google results.

newsbinator · on April 6, 2020

I'm surprised Google's own Search team doesn't get frustrated enough by Pinterest results contaminating their own day-to-day searches that they'd consider ranking Pinterest results lower.

KMag · on April 6, 2020

They aren't going to de-optimize their careers in order to optimize search just for themselves. Google used to be somewhat optimized for power users, but I suspect that learn-to-rank is over-optimizing search ranking for the median user.

[0] https://news.ycombinator.com/item?id=22747889

haberman · on April 7, 2020

There is a Chrome extension for maintaining your own personal block list: https://chrome.google.com/webstore/detail/ublacklist/pncfbmi...

greggman3 · on April 7, 2020

Given the uniformity of recipe site design I'm starting to wonder if there isn't more going on. Like maybe they are all run by the same company or maybe there is a template that every person that wants to run a recipe site is somehow pushed to use.

I mean literally, search for any recipe, click the first 10 links. Screens and screens if large pictures and superfluous text with actual recipe way way down the page.

Lammy · on April 6, 2020

That used to be available as a Google Labs feature.

midef · on April 6, 2020

I like playing around with "Million Short" - https://millionshort.com. It's a search engine that lets you logarithmically filter-out the top websites. It isn't perfect, of course, but its a fun way to discover things.

kilroy123 · on April 6, 2020

I love the idea and have tried to use their service a number of times over the years. I've never been terribly happy with the results.

mcv · on April 6, 2020

That is a fantastic idea.

Of course some of those top sites, especially the blogging and self-hosting platforms, can still contain obscure stuff that might be just what I'm looking for.

gsich · on April 6, 2020

I see mainly two problems:

- Affiliate spam from douchebags that provide "reviews" of products just to link back to Amazon. Makes it nearly impossible to find actual reviews of products.

- People who type whole sentences in natural language into Google. I tried since I used the internet to serach for keywords, omit as much words as possible that aren't necessary. Most people (after I guess ~2010?) don't. This worsens the results.

ivanche · on April 6, 2020

Wow! Now people who know how to type using proper grammar and vocabulary are a problem???

purerandomness · on April 6, 2020

Not OP, but let's say you want to find out the protein content of brussels sprouts.

I would type `protein content brussels sprouts` (without quotes) because I fully understand that the information I'm seeking might be in some tabular form, or phrased in a way I don't anticipate.

Most non-technical people however would type in `What is the protein content of brussels sprouts?` literally.

This leads content creators who see these queries in "keyword analysis tools" to dump SEO-optimized crap into millions of blog posts, with completely irrelevant word soups with countless varations of the question, and the actual information buried deep within that gibberish essay, unreadable by humans, only optimized to drive ad traffic.

Google's optimization for the non-technical use case has lowered the overall search result quality immensely.

There was a time when Google didn't simply ignore some of your search words, or when control characters like +, -, and "" were actually respected (~pre 2010), and the introduction of verbatim mode didn't change much IMHO.

ivanche · on April 6, 2020

Interesting. Why do you think that content creators wouldn't see `protein content brussels sprouts` in "keyword analysis tools"? Why do you think they wouldn't create millions of blog posts containing words `protein content brussels sprouts` or countless variations thereof?

purerandomness · on April 6, 2020

Because Google optimizes for "quality content" since at least the Penguin update. They're using NLP tools to assess the writing quality (similar to algorithms telling you at what school grade level your writing is).

This was good, because it cured all the copy&pastable 2000s era "tag cloud" sites which simply dumped tons of search keywords all over the place.

Ideally, it lead to a stronger emphasis on high-quality human-written content, but it turns out that this algorithm, again, is easily fooled by feeding it "SEO essays" that looks like prose, but is irrelevant text gibberish, but written coherently.

That lead content creators to expand data that would ideally be presented in tabular form on one page, to multi-page "SEO prose" that looks like it's written for humans, but is completely undigestible.

That, along with Google's auto-suggestion feature that finish your sentences after you type in some words, especially on mobile, lead to the impression that people actually like to search in full sentences.

gsich · on April 6, 2020

They do, but the whole-sentence fraction is probably in the majority. Which leads to degraded search results and optimisation in the wrong direction.

ivanche · on April 6, 2020

One can easily make them minority by creating a script which issues keyword-only searches to Google and let it work 24/7/365 from a few hundred machines.

gsich · on April 6, 2020

No, you can't solve it that way.

gsich · on April 6, 2020

When it comes to search engines: Yes. They are.

Kye · on April 6, 2020

Recent example: I wanted to find out why the directions on those Banquet pot pies say to let it stand for five minutes. The words on the box say it finishes cooking in those five minutes. I want to find out more about that: why didn't it finish in the oven? Is it cover for liability, so people don't get burned?

All Google returns is page after page of recipes and posts about Banquet pot pies that have no connection to my input. Google used to be so good at this kind of thing. I know the answer exists somewhere because I found it once a long time ago searching for answers to the same question. Google found it then.

ckolkey · on April 6, 2020

This could perhaps be illustrated well with a hard boiled egg. If you remove an egg at the 7 minute mark, the internal temperature is still around 100c, and it will continue to cook. To stop it cooking, dunk it in cold water.

Similarly with the pot pie, the filling retains a ton of heat, so it still is cooking for those few minutes, as the heat dissipates. If you left it in the oven five minutes longer (to 'finish') then put it in cold water (like the egg), you would have a pie in a similar, though wetter, condition.

Kye · on April 7, 2020

I thought it might be something like that. You're way better than Google. Thanks.

netsharc · on April 6, 2020

I have another "Google is shit now" anecdote: a few months ago I wanted to look up some trivia for the movie "Lord of War". I entered this term in Google, and since there was a game that was just released, it responded with "Showing you results for 'God of War'". No results on the front page had anything related to the Nic Cage movie.

blunte · on April 6, 2020

Perhaps Google believed that nobody would intentionally look for a Nicolas Cage movie...?

greggman3 · on April 6, 2020

Why? He's an academy award winning actor who is hugely popular or else he'd be out of a job.

uk_programmer · on April 6, 2020

Lord of War is a pretty decent movie.

mda · on April 6, 2020

Can you find the exact search term from your search history?

netsharc · on April 6, 2020

It was "lord of war director's commentary". I thought it was because many more people were googling for the game, but I just tried and Google is still doing this!

fingerlocks · on April 6, 2020

If you click the “search instead for...” then the non-video result was exactly what I believe you wanted.

mattmanser · on April 6, 2020

The point is, why is it doing this?

It's not even suggesting both God of War and Lord of War, for me all the results on the first page are about God of War.

hvis · on April 7, 2020

It's probably correcting people's typos, as well as saving a crap-ton of money this way (search results are undoubtedly cached).

eitland · on April 7, 2020

At the cost of destroying their once in-its-own-league quality.

Today I use DDG mostly, thanks to Googles choices. DDG is equally bad WRT this but

- it is easier to move from DDG to Google (just add !g) than the other way around

- and DDG isn't as invasive

krapp · on April 6, 2020

I just searched "Lord of War" and the front page is entirely about the movie. Don't know what to tell you.

netsharc · on April 6, 2020

So you replied to my response about the exact search term I used by telling me... this?

krapp · on April 6, 2020

Yes, you said it was happening currently, so I tried it. The problem you're complaining about doesn't actually appear to exist, at least not objectively.

monktastic1 · on April 7, 2020

> so I tried it.

It doesn't appear that you did. The search term in question is "lord of war director's commentary," and you say you're trying "lord of war." And even if the problem didn't exist for you, but did for them, that does not mean it doesn't exist "objectively."

eitland · on April 6, 2020

You are using a different query it seems.

Also as a non native speaker you appear to be rude.

scollet · on April 6, 2020

They aren't being rude, just increasing the sample size and reporting back. Perhaps we're witnessing SEO for different regions, regulations, and aggregate history of the two.

eitland · on April 6, 2020

Again, I'm not a native speaker, here are the exact words that krapp used:

- "Don't know what to tell you."

- "The problem you're complaining about doesn't actually appear to exist, at least not objectively."

In my limited experience I don't see any reasons to use these exact words in this context except to belittle..?

Anyone care to explain? My seatbelt is fastened and I'm ready to learn (and apologize if necessary :-).

bobisbob · on April 6, 2020

I'm a native English speaker. You're feeling that there is a belittling connotation is valid.

The first quote is potentially a dismissal, which is belittling. However, if it stood alone, it could also be interpreted as the person backing off because they lack qualification to interpret the results. But the second quote includes dismissal terms like "complaining" and "not objectively" in a demeaning context.

So, with all that, I'd say your impression is valid. The context highly suggests that these responses were meant to belittle the parent author's contribution to the conversation.

Here is my take on the parent's search results. The God Of War director ( Cory Barlog ) was a big part of the marketing for the game. And he did some in-game commentary for it too. So, this suggests some kind of SEO manipulation. But it could also just be the google spellcheck guessing wrong.

graeme · on April 6, 2020

Found the point of confusion. “Lord of war” returns the movie. “ Lord of War director's commentary” returns the game.

curmudgeon63 · on April 6, 2020

Intentionally? I'll assume good faith. Under that presumption, my best interpretation of that poster is "socially unaware, likely prone to nitpicky argumentation."

The poster who said he was having difficulty getting good results from Google, was obviously venting about his own personal experience.

Next comes along another poster who says "I tried it. I don't have your problem." Another post down, "... The problem you're complaining about doesn't actually appear to exist, at least not objectively."

Excuse my outburst, but who the fuck says that?

The original poster was venting about a problem he experienced. What good can someone do when he comes in and says that he doesn't experience the same problem, and states that it likely doesn't exist? There is no upside here. There's an only a downside: being rude.

krapp · on April 7, 2020

>Excuse my outburst, but who the fuck says that?

>There is no upside here. There's an only a downside: being rude.

Says the person who apparently created a green account just to shit on my grammar and cast aspersions on me.

Pot, meet kettle.

krapp · on April 6, 2020

I'm not trying to be rude, but I honestly believe a lot of the complaints people make about how useless Google's search results seem overblown. I use Google all the time, sometimes for obscure results, often for technical stuff, and the worst I've ever had to do is look past the first page, but often the first page suffices.

Google showing results for director's commentary of God of War when someone searches "Lord of War director's commentary" is arguably not a failure on Google's part if more people do search for the game than the movie, regardless of the incorrect title.

That said, I completely agree with the theses of TFA. SEO is a cancer.

krapp · on April 6, 2020

Google usually shows a link to "search for x instead" when that happens. And let's be fair, most people searching for "Lord of War" are probably really searching for "God of War."

Having to click an extra link or maybe scroll beyond the front page doesn't make Google a shit show.

trendscout · on April 6, 2020

DDG doesn't work for me either when it comes to more general, not tech-ish stuff. One thing for news I found was https://yetigogo.com

blaser-waffle · on April 6, 2020

Yeah DDG is hit or miss. Most (70% or more) DDG searches work but a lot of times Google or Bing does a better job.

mda · on April 6, 2020

Half your searches? Isn't it a bit dramatic? You can always check your search history, but I seriously doubt 50% of your searches Google can not find anything - assuming data is somewhere accessible-. Tell us a few of the obscure things that you know exist openly and google could not find for you.

I see these claims all the time, but usually with zero examples.

mcv · on April 6, 2020

It's a very rough estimate. I'm not going to check every search in my history. But it feels like the chances I'll find what I need are comparable to the chances I won't.

One example: this weekend I was looking for lyrics from the British folk band Why?. I know they exist; my brother has a bunch of their albums. I have quoted lyrics at Google, I've searched for it on Youtube, I've searched for the band name combined with song titles or names of band members, and I found tons other bands and other random crap, but not the band I was looking for. Eventually I searched for a very specific phrase that was also the title of one of their live albums: "Jig at a Why? Gig", and that finally turned up results.

It's an obscure band, and their name being a common word certainly doesn't help, but certainly combined with song titles, lyrics and band members, it should be pretty clear what I'm looking for? But with Google giving strong preference to the most popular results, Google becomes primarily good at finding things you don't need a search engine for to find them. I want a search engine that's good at finding things that are lost, rather than in plain sight.

BostonFern · on April 6, 2020

Lyrics are notoriously hard to find on Google. I assume it's in part due to copyright issues, but it's also not how Google works. Searches are based on key words, not exact matches of several words in order.

fiblye · on April 6, 2020

There was once I time where I could type maybe 5 random words from a song and get it as the top hit.

Now there are times where I can’t remember the song title, but I can type a few lines of lyrics verbatim plus include the musician’s name and get only random, unrelated links.

mda · on April 6, 2020

I think this particular query has a problem with the with the unfortunate name "why" which is probably causing the confusion. I don't think search engines did a better job before, nor this has anything to do with Seo. Change the "Why" with another obscure band with a distinct name, you would get results, could Gogle do better, sure, is it worse than before, I reaaly don't think so.

hombre_fatal · on April 6, 2020

This is how most complaints about "Google these days" play out.

Once pressed to give actual examples, we realize it wasn't a trivial search anyways and certainly not something better Google did long ago on a technical basis. And Bing certainly isn't doing much better.

Of course, there are some things that Google does filter out these days like things that seem like pirated content.

solstice · on April 7, 2020

Somewhat unrelated to your point but the phrase "Google these days" made me think the following: Could Google freeze its index and capabilities every one or two years and make it available as a sort of "search the web like it was in 2008" archive? That might also solve the problem of how to prioritize search results over time. The people in 2015 are likely to have been interested in different things than the people of 2040, especially for some search terms (vine for example). I mean they already do this by having different search engines for different countries/languages.

userbinator · on April 7, 2020

    - Error messages
    - Error codes
    - IC part numbers

Those are the top 3 things that routinely yield absolutely no useful results for me. The first two seem to produce pages upon pages of spammy SEO sites (with titles like "fix errors now") which don't even contain the relevant error message or code, and the latter alternates between no results or, once again, pages of SEO spam.

When I was still in an office with coworkers a while back, we had developed the habit of yelling "fuck you Google!" and showing a middle finger at the monitor whenever a search yielded absolutely WTF or useless results, which was a cue to everyone else around to jump in and help. A stronger tirade of profanity was reserved for when someone managed to trip the bot-detector CAPTCHA hellban. At first only the former happened once or twice a week, but shortly before working from home, we were getting Google-screwed multiple times a day, and tripping the hellban so often that most of us switched to a combination of Bing and Yahoo; while still not ideal, and the results weren't much better, at least we weren't routinely getting banned from them for trying harder to find what we were looking for.

fasicle · on April 6, 2020

Agreed, I almost always find what I am looking for on my first search. It might be a few search results down the page, but usually what I'm looking for is found on the first search.

Maybe I've just adapted to typing in words and phrases into Google in a way that brings up the results I'm looking for.

bryanrasmussen · on April 6, 2020

https://millionshort.com/about is an implementation of the idea of finding obscure content, but it seems not successful to me.

josephjrobison · on April 7, 2020

Google is of course going to return the most popular results to please the largest amount of people.

If you're looking for more technical results, it'll take a more technical approach.

You should learn search operators - https://support.google.com/websearch/answer/2466433?hl=en - that allow you to control the results for specific sites or very detailed requests.

Even better, set up a custom search engine - https://support.google.com/customsearch/answer/4513882?hl=en - with 100 of your trusted sites and now you have Google search precisely tuned to your needs.

cookiengineer · on April 7, 2020

We should manifest a website that we can both find right now, agree on the search terms and operators used...

... and then meet again, in say, 3 years, where this content is either only findable on tiktokterest or not at all anymore.

Google has not a search problem. It has a chronological and social one.

type0 · on April 7, 2020

Even better, don't use google

basch · on April 6, 2020

I wish I would have seen this coming years ago. I would have built a Google Custom Search Engine, and every time I ran into a good website, added it to the whitelist. By now, it would probably be alright.

unsupervisedluv · on April 6, 2020

Insert allegory about the best time to plant a tree being 20 years ago... Or today.

basch · on April 7, 2020

It is something ive started now.

comzilla · on April 6, 2020

How are you meant to find the most obscure content? If it's obscure, it probably means it is not relevant to your search. How would a search engine like that even work?

Tweaking importance of some sites seems like a nice idea though, but it could also be a bad thing.

nieve · on April 6, 2020

I'm very confused by your definition of obscure. If I search for something rare and not covered very well using as set of precise search terms Altavista and company used to give me almost exclusively relevant results. Modern Google will give me windows helpdesk questions for a query about a linux driver using lots of of quoted terms, at most one of which will appear in the results. Rare results are _exactly_ the time that precise queries give higher quality hits. Substituting a different word will swamp them. Specifying sites to avoid or prefer is an extra signal to help with that

DeathArrow · on April 7, 2020

The smarter Google has become, the less relevant it is.

It was a lot easier finding things in 2004 than today.

That's because it tries to be clever and use synonims and search for terms it deems as related instead of actual search terms.

CameronBanga · on April 6, 2020

This is correct. DuckDuckGo is finally better.

msl · on April 6, 2020

My experience is the opposite: while there certainly was a time when I felt DDG actually respected my queries [1] that time is now gone. The results I get often have very little to do with what I typed in the search box. I find myself resorting to !g more often than ever before.

[1] I seem to have thought so in last December: https://news.ycombinator.com/item?id=18665232

chongli · on April 7, 2020

This was my experience as of a year ago. Nowadays when I do a search on DDG (my default choice) the results are terrible. Then I add !g and the results are even worse. I’ve had so many recent searches for information on technical subjects result in abject failure, leaving me to throw my hands in the air in frustration.

It’s gotten so bad that I’ve installed BasiliskII and SheepShaver [1] just so I can relive the nostalgia of the days when I had my first Macintosh, before I’d even had access to the Internet for the first time. The help system and the documentation for software back then was so much more exhaustive than it is today!

[1] https://www.emaculation.com/doku.php/mac_emulation

Jaruzel · on April 6, 2020

After the last bout of anti-google posts on HN about a month ago, I took the comment advice and switched fully to DDG.

My experience was horrible. I don't tend to search 'popular' subjects, only technical ones, and hobbyist stuff. DDG was just plain useless for this, I had to switch back to google after about a week as I was using the !g prefix almost all the time.