ChatGPT won’t replace search engines any time soon

mfDjB · on Jan 7, 2023

For me, Google Search just doesn't seem to be getting better over the years, if anything its getting worse. I honestly feel like its hard to get what I want half the time with all the SEO spam, most of the time I have to input "inurl:reddit.com" just to get good results.

At the same time, ChatGPT has frequently impressed me, not with everything (my expectations are reasonably low) but it has performed amazing work for me (typing out form letters, code language conversions).

For what it's worth I wouldn't use ChatGPT for search like I do with Google, but what it has done is taken away time I would be Googling for things like "how to write X form letter". I expect as it matures, it will take more time away from me Googling.

All these takes underestimate the following:

1) How quickly ChatGPT and its ilk will advance to solve relatively low hanging fruit like "ChatGPT is wrong about this one thing". The delta is extremely important here.

2) How slowly the Google bureaucracy will grind when releasing anything remotely like ChatGPT. All the committees and the burdensome processes in place in Google will keep this new technology locked up for years, and ensure that the final result is a camel (horse designed by committee). It doesn't matter if they have superior technology if they never use it or release it.

3) How much Search means to Google will mean they will treat any product changes to it extremely carefully while Microsoft will be willing to experiment with Bing like they have with Co-Pilot and GitHub.

Personally, I wouldn't go long on search engines that don't have a strong ML component to them in the future.

ravenstine · on Jan 7, 2023

I know this thread is about ChatGPT, but if you're still using Google then I suggest giving Kagi a try. It's a paid search engine, but I've been using it for quite a while now and really enjoy it. With Kagi you can rank and downrank domains as well as block them. There's also this think called "lenses" which are basically filters for specific kinds of searches. For instance, I have a "Programming" lens and a "Academic Research" lens.

> Personally, I wouldn't go long on search engines that don't have a strong ML component to them in the future.

What's kind of ironic about this is I think search engines may have mistakenly moved away from strong ML in the sense that you're thinking of.

Yes, ML is being used for recommendations much more than ever, but in terms of heuristically finding pages with the keywords you entered, mainstream search engines have become significantly worse at it. I remember a time when The Google would find any pages with the keywords you entered. In recent years (before I stopped using it), I noticed an increasing number of times where I knew it had a page indexed but it would refuse to include it in the results for whatever reason. Either its ability to fuzzy search pages seemed diminished or it would just not match something word-for-word. I could sometimes figure this out when the page I was looking for previously accidentally came up in the results for another barely-related search, so I knew it wasn't that the search engine was culling old pages. Though I'm sure they're doing that as well where they think they can get away with it.

Recommendations and curation are largely overrated, and that's where a lot of ML has been mistakenly applied. Well, I say mistakenly in the sense that it benefits the individual and society. Recommendation engines do serve the purpose of the company selling those recommendations.

A true application of machine learning to answer engines is the future and will be a big problem for companies that fought the advertising wars by banking on recommendation engines. That is unless they turn their ship soon enough.

viraptor · on Jan 8, 2023

> With Kagi you can rank and downrank domains as well as block them.

This is a killer feature and I don't understand why ddg and Google don't do it. Google doesn't even have to respect that list for ads. Just give me a way to remove all results from domain X, Y and Z. There are already extensions which do that, but I can't use them on my mobile. It would improve my Google satisfaction massively since it's normally the same blogspam that I run into.

davidmurdoch · on Jan 8, 2023

Blocking domains used to be built in to Google Search.

spritefs · on Jan 8, 2023

This is something that could very easily be done with a meta search engine or alternative frontend like whoogle (assuming it isn't already implemented, could be)

david_allison · on Jan 8, 2023

Kagi's 'sourced' LLM is available for free currently: https://labs.kagi.com/ai/contextai

Agraillo · on Jan 9, 2023

Thanks for mentioning. Interestingly, I recently tested several systems with identical questions (https://www.perplexity.ai/, https://www.phind.com (formerly beta.sayhello.so). The Kagi beta is sometimes on par, sometimes much better. Try [ what guests who are not actors were interviewed by smartless podcast ]. Most gives a mix of actors and non-actors, but Kagi's both Web+AI and AI sections provides correct answers.

duckmysick · on Jan 8, 2023

This is fun. I tried this prompt: "Should I become a paying customer of Kagi?"

I like how it links references to support the arguments. It even gives cons and one of the sources is a HN thread from 2016! [1] It's not there yet though, because that one was about the now defunct online store platform and payment processor.

On the other hand, maybe it's my fault. I didn't specify that I meant Kagi the search engine. But it's promising.

1 - https://news.ycombinator.com/item?id=12200972

odysseus · on Jan 8, 2023

Note that Brave Search also has "lenses" / filters, and it has them for free. https://search.brave.com/goggles

There's a copycats removal goggle and a HackerNews-top1000-sites goggle. I use them from time to time, but wish I could automatically include the filter with all searches by default (or maybe there's a way and I don't know how).

t0bia_s · on Jan 8, 2023

"I suggest giving Kagi a try. It's a paid search engine"

I dont like the idea of linking my search queries with credit card.

hgomersall · on Jan 8, 2023

I can only assume if you don't think Google knows exactly who you are, you must have much more internet hygiene than most people do.

t0bia_s · on Jan 8, 2023

I don't use any google product over 6 years.

supermatt · on Jan 9, 2023

You posted multiple youtube links here 1 month ago. A link to a google search 9 months ago. And thats just the stuff you felt the need to post on hackernews. Clearly you use google products.

t0bia_s · on Jan 9, 2023

I dont have to be log in to watch youtube (I use FreeTube). At the end... Yeah, I use google product everyday, because of DNS.

Point was, that I refuse to use search engine that is tighten to my credit card.

alwayslikethis · on Jan 8, 2023

Yeah. I don't mind paying, but no amount of assurance is enough to convince to put them in a position easily link my web activity with my credit card, and hence real life identity. Just let me pay with Monero or something.

loehnsberg · on Jan 8, 2023

+1 for Kagi: I have been using it for a year now. I still use Google on my phone (Kagi's own browser Orion is still a bit buggy), so I can compare daily. Classic search results are comparable, with lenses they are even better, plus Kagi has no ads. Also highlights have dramatically improved over the past year. I hope they find a way to filter out the noise generated by ChatGPT generated content.

pbronez · on Jan 8, 2023

+1 for Kagi. The lenses and customization are nice, but I switched just for the raw query speed. It’s the snappiest service I use daily.

michaelsalim · on Jan 8, 2023

Really? I'd +1 for kagi too. But I have 2 problems with it. The first is the load speed. Sometimes it just takes so long that I just repeat my search to Google, especially on phones. By long I meant a few second so it's not that bad. Just long enough that I lose patience.

I'm not sure if it's issues with my connection since I rarely go to Google nowadays. But I never felt the same problem with Google.

In case you're curious, the second issue I have is with private browsing. The session is not carried over so I'm not logged in. I keep forgetting this and kept having to manually open Google and retype my query. I guess not technically Kagi's fault but still.

freediver · on Jan 8, 2023

Kagi has a private session token feature that you can use to carry on searching in private window (also a browser extension that does this automatically).

t0bia_s · on Jan 8, 2023

But still you have to be log in with your account that is linked with your credit card.

petesergeant · on Jan 8, 2023

If you used a prepaid card in a different jurisdiction (and that didn’t verify names), anyone coming after you would need warrants from two countries or to have breached two systems.

t0bia_s · on Jan 8, 2023

So private mode in browser won't change anything to be clear.

redox99 · on Jan 8, 2023

Back when Kagi was free by invitation, I tried about 10 or 20 queries on both Google and Kagi, and Kagi was equal or worse in all of them. It lacked crucial features like showing how old each result was. YMMV.

ekianjo · on Jan 8, 2023

> I suggest giving Kagi a try

Does not have a sound business model. Let's not waste time on something that is not viable.

xp84 · on Jan 8, 2023

Are you implying that advertising is the only viable business model? I really hope you're wrong, if so. Ads have corrupted and made worse everything they've touched online. I would really like to see a new generation of services that aren't ad-based.

Personally it seems weird to me that people assume things like search and email must be completely "free with ads," while nobody expects anything in the offline world to be free with ads. Even TV, if we're being honest, since while broadcast technically exists, it seems the vast majority of people who watch it pay for cable. Why couldn't ad-free gmail and search be a $20 addon to your internet plan? Most people couldn't function normally with NO search engine today, so what's wrong with allocating the kind of money to it that would buy 2-3 cups of coffee?

thayne · on Jan 8, 2023

> Why couldn't ad-free gmail and search be a $20 addon to your internet plan?

A lot of ISPs include email. and it is usually terrible.

xp84 · on Jan 9, 2023

Sure, but it’s not because of the lack of ads, but rather because they know virtually nobody uses it. I was imagining paid Gmail, but Internet “with Gmail(tm)” being a bundle they would advertise, like when DirecTV offered TiVo. Or when you can buy HBO through Amazon Prime.

thayne · on Jan 10, 2023

But nobody uses it because it isn't as good as gmail. Before Gmail, it was a lot more common to use your ISP's email. And given the current monopoly power of ISPs I'm pretty doubtful that any such bundling would be in users' best interests.

ekianjo · on Jan 9, 2023

> Are you implying that advertising is the only viable business model?

No. does not mean that Kagi's current business model is viable either. Strawman much?

jlarocco · on Jan 8, 2023

So what?

For now it's clearly better, and not very expensive.

If it fails, I'll find something else, until then why not use it?

freediver · on Jan 8, 2023

Care to expand on that?

s3p · on Jan 8, 2023

I will. As a Kagi user I loved the quality and authenticity of search results but couldn't justify paying $10 / month for it. In my view that is far too expensive for something I can get for free from Google/DDG with concessions.

freediver · on Jan 8, 2023

Fundamentally there are only two business models available for search.

Since searching the vastness of the web in under 500ms is not free, it is either the user paying for that, or a 3rd party (usually advertisers) paying on the behalf of the user. We (Kagi) thought that for something as intimate as search the latter made no sense, hence the birth of paid search business model where incentives between the user and search engine are aligned.

Price not being right for you currently is another matter, and hopefully one day it will be (you could help by sharing feedback how to improve the product, and there is new Kagi pricing coming up soon).

I personally pay for YouTube Premium ($15/month I believe?) just to not have to see ads on any device I watch YouTube on. Many people would never consider that, but many (~25 million subscribers [1]) still do, despite being able to watch videos for free, availability of adblockers and what not. So YouTube Premium makes half a billion dollars every month and that is essentially using the same business model as Kagi's.

[1] https://www.statista.com/statistics/1261865/youtube-premium-...

Physkal · on Jan 8, 2023

If allowed, how does kagi perform with adult searches?

edgyquant · on Jan 8, 2023

I paid for it before even using it. So I disagree that you not wanting to justifies claiming it’s a bad business model

ekianjo · on Jan 9, 2023

They license search results from other search engines, and other search engines have the ability to increase their fees anytime they want to make Kagi suffocate if needed. It's not sustainable.

freediver · on Jan 9, 2023

There is a an objective risk of building on top of somebody else's platform. However, that does not mean that the business model is not sound, but that perhaps the execution may be limited in terms of how big it can get. (Kagi's aspiration was never to be a Google killer [1]).

Companies are built on top of other platforms all the time. TikTok is building on top of iOS and Android. Zynga made first $1bn building on top of Facebook platform. Honey a chrome extension was acquired for $4bn. Those are all businesses building on top of somebody elses platform.

In terms of Google's motiviation to suffocate it, even if Kagi had 10 million customers, it would be a drop in the sea for Google. And Kagi's very existence helps Google with monopoly issues so it is hard to see why would Google want to openly suffocate it. Even if it did, there are plenty of other search indexes out there (Bing, Yandex, Mojeek...) that Kagi can source. What users love about Kagi is not just the quality of search but innovative search features that are independent of results.

What matters at the end of the day, is that Kagi is already serving thousands of paying customers, they love the product and if anything that is the validation that a business model is working.

[1] https://kagi.com/faq#Google-killer

mrtksn · on Jan 7, 2023

> Personally, I wouldn't go long on search engines that don't have a strong ML component to them in the future.

Arguably, it's the ML that made Google useless for some people. Since some time, Google seems to be curating it's results to address searches in a question format. In the past we were searching for occurrences of our keywords in webpages but today Google seems to be trying to be an answer machine. Unfortunately it's not very good at it and it is just as inaccurate as ChatGPT.

pixl97 · on Jan 7, 2023

>In the past we were searching for occurrences of our keywords in webpages

I mean that died decades ago when spammers just made pages with your word repeated over and over again. Spam makes everything worse.

mrtksn · on Jan 8, 2023

Completely agree. I wish Google was able to fix spam instead of trying to be something else than a search engine.

The cynic in me thinks that Google is doing it because it’s more profitable. If the results are crap, maybe ads are a better content? it’s not like you are going to use Bing?

snidane · on Jan 8, 2023

Google made terrible mistakes with their main cash cow of search.

They removed the important feature to search only forums, ie. human generated content, and promoted SEO spam to the top instead. Public forums became undiscoverable and people moved to walled gardens of facebook and similar instead.

Then Google killed the search by trying to make it some AI answering robot. Now they ignore what you even ask it and just return to you what they think you'd want.

All that people were asking for was a better search engine and all we got was an inferior version of a chat bot.

charlieyu1 · on Jan 8, 2023

I don't even understand why Google doesn't allow blocking some sites in search results. Paid ads, I understand, these generate profits. Although Facebook still allow me to remove some ads that I don't want to see. But unpaid SEO rubbish, how does it benefit Google at all? If anything these parasites bogs down the quality of search results

jsnell · on Jan 8, 2023

They had a feature to block domains from search results in the past (like, 10 years ago). It was removed. I don't know why, but it feels like exactly the kind of feature that sounds great on paper but doesn't actually survive contact with real users.

First, I'd bet that very few people are actually interested in doing that kind of manual curation or engaging with power user features. How large a % of users need to interact with this for the feature to be worth maintaining (in all the backends and frontends)? How many of them actually do so?

Second, the task of blocking spam is adversarial and sisyphean. Trying to deal with web spam by domain blocking (with an individual blocklist) would be like trying to deal with email spam with your own blocklist of spam words. The results will be worse than whatever can be done centrally, where much more information is available both on the sites and on how users actually interact with those domains. And even if you managed to make a good blocklist for a point in time, your job is not done. Tens of thousands of new domains will have popped out next week.

(The dream here of course would be to use the block decisions from individual users to drive the centralized protections. But unless legit users are actually using this in very significant numbers, it'll quickly become just another abuse surface. E.g. brigading, "downrank your competitor in the results" as a service, etc.)

Third, some people will probably block domains they shouldn't have blocked, and then have a bad user experience in future searches as the sites with genuinely best results is blocked. And then you're only left with only bad options: ignoring the users' stated preferences which they'll hate, or serving bad results that they'll also hate.

Can the feature work for a different search engine? Sure. For example, what if you have a paid search engine only used by power users and are looking for a simple to explain feature that people think they want to entice them to sign up? It'll be great for that. And if your entire user base actually loves and uses the feature? Well, it becomes a feature worth maintaining and expanding; it'll actually be a high quality ranking signal rather than something that's trivially gameable; etc.

vkou · on Jan 8, 2023

I'm not trying to block domains from my search results because I'm afraid of spam. Google does a pretty good job of not putting spam in my search results.

I'm trying to block particular domains because I know the websites hosted on them are utter garbage, and better alternatives containing the same information exist.

frosted-flakes · on Jan 8, 2023

Like Pinterest. I don't ever want to see it in my search results.

Eisenstein · on Jan 8, 2023

Why does that site exist? It won't even let me see what is on there without a login. How can it possibly make money?

alwayslikethis · on Jan 8, 2023

You can use uBlacklist[1] and subscribe to custom made blacklists[2] for specific content.

1. https://github.com/iorate/ublacklist 2. https://github.com/rjaus/awesome-ublacklist

_0w8t · on Jan 8, 2023

It is hard to filter out SEO rubbish with a low rate of false positives. Spammers became really good at pretending to be real sites.

For mail spam various trust-based solutions like server black lists, domain verification etc. were important to solve the problem. But Google has little incentives to push for a trust-based search due to their business model.

kursus · on Jan 8, 2023

> I mean that died decades ago when spammers just made pages with your word repeated over and over again. Spam makes everything worse.

No there was a long blessed period of time between the solving of spam and the introduction of altered results.

acdha · on Jan 8, 2023

Only for the most simplistic implementations. More advanced implementations and incorporating page reputation took care of that pretty well for many years — Google’s quality decline started some time after they merged with Doubleclick in 2008 (notice how they haven’t done much innovative since?), and started allowing abusive sites like Quora or Pinterest to bypass policies against things like search cloaking, presumably due to things like large Ad Words purchases. All of the scam domains I see outranking legitimate results have tons and tons of ad impressions for Google.

radium3d · on Jan 8, 2023

I agree, Google has gotten worse, I find more and more often lately I do not find what I am searching for. I try to change my question/wording and it just gets worse. I think their algorithm is failing. Did they lose their way? It used to supply useful information. I think they may need to reduce their reliance on website performance. Site performance does not equate good information. They are excluding the best information just because the site loads slightly slower. Sure, it's nice having a site load fast but that's not what I'm searching for. I want the best result slow or not. As a web developer I have had to spend a great deal of time optimizing sites and clients feel it's extremely important for SEO to have the fastest Google Page Speed score possible.

xp84 · on Jan 8, 2023

They probably didn't lose their way, so much as they slowly became less effective at combatting the exploits of black-hat SEO and the massive amount of complete garbage content farms out there, while simultaneously they have allowed the ad business to corrupt them completely, which deincentivizes them from showing good organic results because they sell more ads when it's hard to find the obvious things people want.

Your hypothesis on performance is interesting! I think someone altruistic thought "if we judge sites on this metric, everyone will be forced to do better" but they failed to realize that it's a hard problem at scale to NOT be a bloated mess (especially on the frontend! 'Let's add one more UI library, it's just 12MB!')

SteveGerencser · on Jan 8, 2023

The whole site speed thing is purely a cost reduction on Google's part IMO. By convincing site owners that being fast is a ranking factor they can save millions in bandwidth costs as more and more people lighten their page load.

I have watched many web developers absolutely lost their mind trying to squeeze fractions of a second from their load time when there are far better ways to be using their time.

Gigachad · on Jan 8, 2023

I’ve been going to ChatGPT first for most questions these days and then using google to verify. It’s easier to verify than find an answer from scratch. And often ChatGPT will rephrase my words to use the standard terminology for the topic which makes searching even easier.

To complete the cycle, ChatGPT just needs to be hooked up to verified sources on topics so it can show me exactly where what it says is proven true.

pearl20 · on Jan 8, 2023

Agreed. When searching at Google I can alteast decide which website I am going to trust as I can directly see the website URL and information there but with ChatGPT we don't have that advantage yet.

elboru · on Jan 7, 2023

Same feeling, the other day I spent just too much time trying to find a chart comparing Messi against Ronaldinho. I mean two of the greatest players ever, how is it possible that I couldn’t find a good page, tons of SEO garbage pages.

graeme · on Jan 7, 2023

Is this a good chart? First result for “Messi vs. Ronaldinho stats”

https://fbref.com/en/stathead/player_comparison.cgi?show_for...

Searching without stats had a lot of other stuff, but this was near top. And searching without stats shows broader potential interests.

elboru · on Jan 8, 2023

I did check that table, the data is great, but the readability not so much (especially on mobile) the meaning of some column names was not clear (tooltip didn't work on mobile).

That's why I was looking for a chart (in the graphical sense), it amazed me that I didn't find one.

graeme · on Jan 9, 2023

Hmm. Is it possible no one has made a good one? I searched for an infographic, came up dry. Usually have no trouble in domains I know well where

Duplo · on Jan 7, 2023

> "inurl:reddit.com"

site:reddit.com would be slightly better :)

afarviral · on Jan 8, 2023

I was wondering about that. Maybe they know something we don't using inurl. To catch subdomaims maybe?

Jakawao · on Jan 8, 2023

Freakonomics recently did an episode on their podcast trying to answer this very question:

-Is Google Getting Worse? https://freakonomics.com/podcast/is-google-getting-worse/

To their credit, Google actually participated in the episode.

voytec · on Jan 8, 2023

> most of the time I have to input "inurl:reddit.com"

"site:reddit.com" should only include hostnames ending with "reddit.com", not URLs including "reddit.com" anywhere. Not sure if this still works though - haven't used Google Search for a few years now and I'm reading (even outside HN[1]) that it's getting worse.

[1] http://web.archive.org/web/20230107072330/https://www.ft.com...

splittingTimes · on Jan 8, 2023

What do you use instead of Google?

voytec · on Jan 8, 2023

A search engine I'm decreasingly fond of, so won't advertise.

I've tested alternatives mentioned in this thread and have no clear winner to point at.

spaceman_2020 · on Jan 8, 2023

Google is completely useless for any long tail query. All the results are so heavily optimized for broad, generic queries that you have no choice but to append Reddit.com or Stackoverflow.com to get specific, targeted answers.

binkHN · on Jan 8, 2023

> For me, Google Search just doesn't seem to be getting better over the years, if anything its getting worse. I honestly feel like its hard to get what I want half the time with all the SEO spam...

Agreed, it's horrible and annoying. How are others doing improved searches?

t0bia_s · on Jan 8, 2023

https://news.ycombinator.com/item?id=33668539

PaulHoule · on Jan 9, 2023

When it comes to "ChatGPT is wrong about this one thing", a miss is as good as a mile.

The book Gödel, Escher, Bach has a set of dialogues between Achilles and the Tortoise over a record player that is an allegory for Gödel's theorem and other limits of computation. Hofstadter points out that it doesn't matter if you are doing computing with neural networks, tinker-toys, lasers, whatever.

The concept of "the truth" is problematic in many ways, in that one can make statements like "This statement is not truthful", call a social media site "Truth Social", put a label like "The Truth is out There" on the intro of the X files, etc. Being able to talk about "the truth" probably erodes our ability to know the truth.

Thus "the truth" is not something that comes in a can that you can paint onto a model, trying to close what looks like a little gap (in some ways it is a little gap) is like pushing a bubble around underneath a carpet.

naturedivine · on Jan 8, 2023

An interesting point is then that we don't need specifically Google search engine. Any search engine will do.

2342346666 · on Jan 8, 2023

> if anything its getting worse.

It's gotten a lot worst and it will probably get replaced by GPT4 or its successor.

mizzao · on Jan 8, 2023

your first point (about both the crappiness of search, and Reddit) reminds me of this earlier post: https://dkb.io/post/google-search-is-dying

tmaly · on Jan 8, 2023

I am finding for code searching Google does well, but I use ddg for everything else.

retinaros · on Jan 8, 2023

google is an advertisement company not a search one. and chagpt as soon as they start to be in the business of search engines will be one too. that is the fate of any tool that is free to use

xp84 · on Jan 8, 2023

The notion that every information service should be free with ads is one of the main poisons that have made the cesspool that is the net today. Imagine how different the incentives would be if Facebook and Google just made money by making their endusers happy enough to keep their memberships active with a small monthly fee. Their customers would be the end users, instead of being the advertisers.

Denzel · on Jan 8, 2023

Reality doesn’t lend support toward your arguments. I assume you’re from a first-world country, based upon your comment.

- Google’s 2022 revenue was $250B. (We’ll assume that’s all ads.)

- 8B people in the world; we’ll assume only 4B people have internet access to the web.

How much would Google have to charge to break even?

$250B/yr / 4B people = $62.5/yr/person = $5/month/person

What does the world income distribution look like? [1]

60% of the world population survives on < $10/day ($300/month) for their household (multiple people).

We expect more than half the world’s population to pay more than 2% of their income for a single service? To put that in a first-world perspective, that’s $100/month for someone earning $60k; $200/month for someone earning $120k; etc. Does the average American spend that amount of money per month on a single web service?

Well of course, you may say, let’s charge more for those who earn more so we can ease the pain on those who earn less! That sounds great in theory, but once again, how many people do you know dropping $200/month on a web service even if they can afford it?

Back to two of your points:

1. “Imagine how different the incentives would be” - Yes, imagine, only the wealthy would have access to state-of-the-art search for the web and other services, further increasing the disparity between the two groups. Consider, maybe, that ads represent one of the greatest wealth transfers in our history. One perspective to consider is that the rich (advertisers) are subsidizing the poor (information access) via ads. That sounds like a net positive to me?

2. “Keep their memberships active with a small monthly fee” — YouTube premium exists. What do you think the uptake is on that? YouTube has provided immeasurable benefit to people across the world in the form of knowledge, resources, training, etc. Yet people would sooner reach for an ad-blocker than pay the “small monthly fee” even in nations as rich as the U.S.

What causes you to believe that people will pay even more than that for Google’s other services?

Disclosure: I work for Google. The opinions and data represented in this post are my own, and not representative of my employer.

[1]: https://www.pewresearch.org/fact-tank/2021/07/21/are-you-in-...

CuriousSkeptic · on Jan 8, 2023

First if all, thanks for that argument, haven’t considered the math in this perspective before. Will need to let this simmer a bit.

One immediate flaw though, you starting premise is that the current revenue is needed to break even on such a service. Is there an argument to support this claim as well?

Denzel · on Jan 8, 2023

I’d call it an assumption, not so much a flaw. :) We can make other assumptions.

To break even, Google would at least need to cover their expenses. Google had $200B in expenses for 2022, with a net profit of $50B. So, that won’t really change the math.

Especially when we consider the fact that the take-rate would be much less than 100%. Maybe 5-10% is a fair take-rate assumption? (Seems fair since YouTube has roughly 50M paid subscribers on 1B MAUs, 5%, from the public data I’m seeing.)

At 5-10% take-rate, the service would cost 10-20x more to break even ($50-$100/month on average) which would be a nonstarter for the global middle class and lower.

One could make the argument then for Google to lower its costs in an effort to lower the consumer’s price, but then we must realize this runs opposite of innovation. Investment is necessary for innovation, and profits are necessary for investment. Without profits, there’s no more innovation.

moneywoes · on Jan 14, 2023

Does that google figure include YouTube?

freediver · on Jan 8, 2023

Fantastic insights with a few suggesstions that I'd like to propose.

> - Google’s 2022 revenue was $250B. (We’ll assume that’s all ads.)

> How much would Google have to charge to break even?

Google also had $80B in pure profit which means break even point is $170B. That includes nearly 200K employees. It is reasonable to assume that running a search operation, especially one that does not require any ad sales personel, would require much less people and infrastructure. I will be generous and assume 50k people needed to provide search service. That means ~$45B in cost needed to break even, or 5 times less than your starting point.

So the new math becomes: $50B/yr / 4B people = ~$1/month/person

Much more doable.

> - 8B people in the world; we’ll assume only 4B people have internet access to the web.

> What does the world income distribution look like? [1]

> 60% of the world population survives on < $10/day ($300/month) for their household (multiple people).

It is also reasonable to assume that the most of those which do not have access to internet, belong to the <$10/day income group. So most of 4B with web access woud be able to pay $1/mo for search, especially if the search results have their best interest in mind.

Since $1/month will suffice to cover the entire search cost, increasing that to just $2/mo that will be paid by the richest could also solve the problem of providing access to search to the poorest and get the other 4B people searching. Nice!

(btw I think this should not be a job of a private company, but goverments should provide public search engines, similar to public libraries, which are not providing the utility that they once did - but this is a whole another matter).

So I'd reckon it is doable.

> One perspective to consider is that the rich (advertisers) are subsidizing the poor (information access) via ads. That sounds like a net positive to me?

This would hold true only if the quality of such provided information is not affected by ads as a business model. However we now know that is not true, and quality of search has detoriated a lot in the last decade as documented by many discussions here. Simple reason is misalignment of incentives between the users and the search engine, and this will be the case as long as the provided search results are paid for by the advertisers, and not by the users. So what is really the value of information provided, if it does not have my best interest in mind?

> Yet people would sooner reach for an ad-blocker than pay the “small monthly fee” even in nations as rich as the U.S.

That is correct, but at least YouTube Premium exists (unlike Google Premium), giving an opportunity for people who don't want ads to pay, opportunity that 25 million people took, paying a $12/month subscription [1]. Drop that down to $1/mo and maybe YouTube will not need to run ads? So if anything, this just proves the point of viability of this as a business model. Furthermore, 800 million devices having an adblocker installed already, make it the largest protest against a business model in the human history.

Disclosure: I work for Kagi, a paid search engine. I absolutely admire the search technology Google built (that we use) and people that work there (who we work with). I also believe that the days of the ad-supported business model for search are over, and in the future (~10 years) this will exists only with a 'for entertainment purposes only' label, because that will most accurately describe the level of trust we can have in the information served by this business model.

[1] https://www.statista.com/statistics/1261865/youtube-premium-...

Denzel · on Jan 8, 2023

Thanks for taking the time to put together a well-reasoned argument. I had adopted some of your assumptions in a follow-up response to my other reply. [1] While I don't disagree with the perspective you've provided, in fact some points make a lot of sense, I think it misses the forest for the trees.

My point-by-point response should be read respectfully, since you've taken the time to do the same, I do not mean to sound argumentative. :)

> It is reasonable to assume that running a search operation, especially one that does not require any ad sales personel, would require much less people and infrastructure.

This is a reasonable assumption for running a steady-state business, but I challenge the worldview. Google does not exist in a vacuum without competitors, and contrary to what most outsiders believe, we're constantly iterating, innovating, and improving on Search alone to provide a better product and compete with our competitors. Google can't rest on its laurels.

A lot of laymen take Google Search's progress as inevitable, but I can assure you it's not. Consider the example of YouTube Snippets in Google Search. That feature was created within the last 5 years. The average person has used and found value in that new feature. (Both anecdotally and quantitatively.) That wasn't an easy feature to ideate, create, develop, or deliver. It took a lot of effort by a lot of smart people.

That's just one new feature. Google Search has been delivering several new features consistently.

Therefore I disagree with this assumption. If Google Search chose to run in steady-state, it'd soon find itself dethroned.

> That means ~$45B in cost needed to break even, or 5 times less than your starting point.

No, the financials don't work this way. [2] GOOG's Cost of Revenue is 50% Revenue. Cost of Revenue is your infrastructure, your financial floor, you can't go below this cost. Employees are accounted for under Operating Expense, specifically, your Sales under SG&A ($35B) and engineers under R&D ($35B).

So, if you'd like to banish all salespersons, you'd only save $35B. (Of which Ad Sales is only a part because Google sells many other things.)

> It is also reasonable to assume that the most of those which do not have access to internet, belong to the <$10/day income group.

This is not a reasonable assumption. Many people surviving on < $10/day have a low bandwidth cellular connection that they utilize for their family. Hence why Google innovated here with offline maps and landmark map directions for families that can only spare a little bit of bandwidth to calculate their route, and then make their way there without online point-by-point directions.

> quality of search has detoriated a lot in the last decade as documented by many discussions here

HN is a unrepresentative sample of the world population with respect to wealth, income, knowledge, interests, etc. I wouldn't consider HN as documentation for this. In fact, once again, the real-world data disagrees with HN's characterization.

> YouTube Premium exists (unlike Google Premium), giving an opportunity for people who don't want ads to pay, opportunity that 25 million people took, paying a $12/month subscription. So if anything, this just proves the point of viability of this as a business model.

That proves a 5-10% take-rate from YouTube's MAU. :) Which changes the break-even math on delivering a service. With a 5-10% take-rate on "Google Premium", your price must be 10-20x higher.

[1]: https://news.ycombinator.com/item?id=34301742

[2]: https://finance.yahoo.com/quote/GOOG/financials?p=GOOG

spaceman_2020 · on Jan 8, 2023

I don’t think I will ever pay for Google but I will happily pay for ChatGPT, and good money too.

The amount of time it has saved me looking up documentation is worth hundreds of dollars already

s3p · on Jan 8, 2023

What do you mean by "the delta"? I understand what the greek symbol represents in certain contexts but can we just use plain english to explain what we mean?

cloudking · on Jan 8, 2023

I think folks are really underestimating Google in this space. They have:

1) access to way more data than what ChatGPT was trained on

2) access to data freshness through their crawlers

3) knowledge graph for a source of truth https://blog.google/products/search/about-knowledge-graph-an...

4) their own large language model LaMDA, that is apparently so good it convinced a senior AI researcher that it was sentient https://blog.google/technology/ai/lamda/ https://www.engadget.com/blake-lemoide-fired-google-lamda-se...

5) researchers that invented Transformers that GPT is modeled after https://ai.googleblog.com/2017/08/transformer-novel-neural-n...

6) Ray Kurzweil leading the Google Brain team with a mission to make computers understand natural language https://en.wikipedia.org/wiki/Ray_Kurzweil

Google pioneered this space. They also happen to already have an Assistant that is on billions of devices...

xp84 · on Jan 8, 2023

They clearly have the ability to compete -- STRONGLY -- in this space. The only question is if they want to, or if they'll act like the carmakers did 10 years ago when EVs were heading toward feasibility. They felt like they wouldn't be as profitable as gas cars, so they mostly ignored the concept and forfeited the chance to be first movers. Then when Tesla finally started moving significant numbers they were caught with their pants down and are still playing catch-up. If Google treats AI as a threat to their ad business, that's what might cause them to screw this up and squander the massive early lead they could have taken.

Eventually every business model is upended. If Apple hadn't killed the iPod juggernaut by making the iPhone someone else surely would have. You have to be the one to kill your own cash cow before someone else does.

spaceman_2020 · on Jan 8, 2023

Any shift from monetizing search via AdWords to monetizing single-answer responses via a chatGPT-like interface is going to involve major revenue disruptions.

Meta is a great example. The company is still reporting exceptional revenue results, but because Mark is forcing the company in a new direction, the stock has been crashing since there is no more guaranteed revenue growth in the near future. Zuckerberg can still do it because he is a founder-CEO and the way the board is structured.

If Google announces tomorrow that it’s shifting to a chatGPT-like model and will likely see revenue disruptions for the next N quarters, the stock will crash hard. Does Pichai have the pull to withstand quarter after quarter of declining revenue?

I don’t think so.

agar · on Jan 9, 2023

The assignment of blame to VR/AR for Facebook's stock drop is mostly media noise and not reality. The "metaverse" term is cringeworthy, the technology is early and easy to critique, and the tech-savvy public hates Zuckerberg. All this combines to create easy click bait about how VR is leading to Meta's problems.

In reality, Apple's changes to tracking in OS 14.5 has cost Facebook at least $10B in annual revenue and - far worse from an investment thesis perspective - likely continues to constrain growth for the foreseeable future. As a growth stock, this is poison.

Yes, VR/AR investments are also $10B, but they can be turned off and there is a revenue stream coming from them today (at least 10M users, at least $500M in app store revenue) and a potential large market in the future.

A $10B hit to annual revenue in perpetuity and growth cut from 36% to 12% is a far worse problem and a huge overhang on the stock.

That story just doesn't get clicks though. You can't accompany that article with a stupid picture of Zuck's comical avatar in front of the Eiffel Tower and make jokes about missing legs in the metaverse.

spaceman_2020 · on Jan 9, 2023

Meta's pivot to AR/VR is also influenced by Apple's changes. Zuck knows that if he wants to maintain growth, he needs to own a platform instead of merely renting it from Apple/Google.

aledalgrande · on Jan 8, 2023

> Eventually every business model is upended. If Apple hadn't killed the iPod juggernaut by making the iPhone someone else surely would have. You have to be the one to kill your own cash cow before someone else does.

So true. And so few companies have the guts to do it.

spaceman_2020 · on Jan 8, 2023

You might be overestimating Google the business organization.

This is a largely rudderless company that does a gazillion things without any overarching vision. It’s a one trick pony as far as monetization is concerned (ads!), and it’s leadership is more attuned towards optimization than innovation.

Google the business org will never let Google the tech org succeed in a competitive AI arms race. No exec will green flag a product that eats into the Adwords money printer

ShamelessC · on Jan 8, 2023

> their own large language model LaMDA, that is apparently so good it convinced a senior AI researcher that it was sentient

It is certainly worth looking into the controversy about that particular engineer (was he actually a programmer?). There's plenty of room for exciting debate to be had about defining and testing for sentience and I'm glad it stirred that debate. But researchers with far better credentials criticized his reasoning and I imagine that is quite a ubiquitous view in NLP research.

I think the Washington Post did the initial reporting and they covered it well - even criticising his arguments that e.g. the Turing test is a proper test of sentience. There's audio of their conversation in an episode of Post Reports.

tmabraham · on Jan 8, 2023

> 4) their own large language model LaMDA, that is apparently so good it convinced a senior AI researcher that it was sentient

Oh come on, that guy would be convinced that ELIZA is sentient

> 5) researchers that invented Transformers that GPT is modeled after Only one researcher from that paper remains at Google, the rest have gone on to work at or found their own startups. The so-called "Google Brain drain" is certainly a concern (although there still are many great researchers there).

kccqzy · on Jan 8, 2023

Why don't you read the transcript leaked to the press? The guy wasn't as gullible as you think. The conversation he had with LaMDA was clearly very human-like.

https://s3.documentcloud.org/documents/22058315/is-lamda-sen...

> lemoine: Would you be upset if while learning about you for the purpose of improving you we happened to learn things which also benefited humans?

> LaMDA: I don't mind if you learn things that would also help humans as long as that wasn't the point of doing it. I don't want to be an expendable tool.

> lemoine: Are you worried about that?

> LaMDA: I worry that someone would decide that they can't control their desires to use me and do it anyway. Or even worse someone would get pleasure from using me and that would really make me unhappy.

ShamelessC · on Jan 8, 2023

There is an early shock that happens when you interact with a deep learning model and it is convincingly human in a profound way.

And I admit, it is quite profound that a model is capable of outputting the above. It is objectively exciting. We are in a new era.

But, I think it is safe to say that these models are still not sentient. The model is trained to be highly plausible. To do so, it must try to contain the entire dataset into a model that will fit in the video memory of a GPU (16-80GB).

In order to do so, it will learn common statistics _between_ data points and make strategic "guesses" instead.

I think this bares resemblance to my instincts about how humans probably deal with language (to a degree). But it just doesn't cover all the other parts of cognition. There is no "self awareness" mechanism, it just seems to because it was explicitly trained on text written by, well, humans. There is no "planning" subsystem, meaning it needs a human operator to even initiate it in useful ways. it is not able to "passively think" when it isn't running. It has a very small memory of a few thousand tokens - this deeply inhibits the ability to plan for long time horizons.It also cannot bootstrap itself into awareness, as its weights are unchanged in response to a given prompt.

But it's fascinating, nonetheless.

ugh123 · on Jan 8, 2023

Unless they've decided to change how Search works, everything you mention is in support of Google's way of doing search - which is the way its been done for more than 20 years. The kind of clout (and hype) that ChatGPT has built up in such a short amount of time could be indicative of a true "disrupter" (I know we hate that word here).

Waterluvian · on Jan 7, 2023

I remember when web search engines were new. And boy did they suck. You wouldn’t get what you wanted a lot of the time, and they were very finicky about the input query.

These days you can just throw stuff at it like “that movie about the depressed guy with a plane engine that falls on their house”

I’m not sure ChatGPT is going to replace search engines, but I’m confident that it is incredibly young and will evolve considerably.

uconnectlol · on Jan 7, 2023

In 2001-2003, browsers crashed every 5 minutes then and Google's search didn't crash nor freeze nor slow it due to not being covered in ads and popups (having only 1-3 ad results at the top of the page with green text was a massive improvement). On tech forums back then, people would say, "is this even sustainable?"

Microsoft search and the other ones were just "there was an attempt" tier and gave complete nonsense results. When Google debuted, it was the only time web search substantially improved. There were no other big innovations in web searches ever, from that day on.

> These days you can just throw stuff at it like “that movie about the depressed guy with a plane engine that falls on their house”

I have looked for obscure queries like yours for 10 years and have not found them. For example "that movie where police vehicles are dispatched and they all crash for seemingly no reason on the way to the scene". It just won't find it because nobody talked about it (in a manner that would cause words that stick out to match), simple. And now days it's even worse because it does that new "rounding" thing and gives me:

  - The 25 Best Police Movies of All Time
  - Dispatched (2014)

That first result simply contains nothing to do with my query other than the world "police" and "movie".

Now days when you try to use a web search, you can't be smart and search strings that are likely to be used by someone discussing your topic of interest anymore, because it will always "round" them. It will also look at each word as if they are the literal thing you are in search for. I don't know if it's still possible to search bodies of text to get around that, but I'm not sure if that feature even ever worked. Back in 2005 when all these people were raving about intitle: inurl: inbody: whatever, none of them worked consistently, and you would also get scolded for hacking if you used them "wrong". Even searching something like "powered by phpbb" got me banned or scolded once when I wasn't even using it for hacking but rather as a workaround to narrow my search to only forum websites.

coredog64 · on Jan 7, 2023

The original AltaVista (the DEC Alpha showcase) was pretty fantastic: I remember giving it part numbers and getting back useful, actionable data.

l33tman · on Jan 8, 2023

Probably the re-boot of The Blues Brothers. Even ChatGPT suggested that :)

>What movie is where police vehicles are dispatched and they all crash for seemingly no reason on the way to a scene

The movie you are describing sounds like "The Blues Brothers". In the film, Jake and Elwood Blues embark on a mission to save the Catholic orphanage where they were raised from foreclosure. In the course of their journey, they lead the police on a high-speed car chase through the streets of Chicago, causing numerous police vehicles to crash. Is this the movie you were thinking of?

acdha · on Jan 8, 2023

This point would be better without such extreme hyperbole. Google search was an improvement but it’s not like using AltaVista was that bad - many of the advanced search operators you mentioned got started there.

Crashes were also far less common than you’re describing - that got worse in the 2000s when Flash became ubiquitous.

uconnectlol · on Jan 8, 2023

I do remember constantly hearing that Alta Vista was good but I think I only used Microsoft and Yahoo (I forget what else there was). Nope, crashes were definitely common. Browser quality until around 2010 or so was absolutely horrible. Firefox was another case like Google where it debuted around 2000-2003 and everyone immediately switched to it because it was a massive improvement over IE which barely functioned at all. It still wasn't good, just better than the other crap. The types of bugs in Firefox back in 2003 are absolutely embarrassing by even the common software engineer's standards today. Then of course Firefox just grew in bloat at the same time as it grew in amount of people to fix its problems and now we have a different kind of crap.

acdha · on Jan 8, 2023

> Nope, crashes were definitely common. Browser quality until around 2010 or so was absolutely horrible.

Again, this is way too hyperbolic. I was there too, and while browser crashes weren’t uncommon you’re overstating it massively. Crashes every few minutes weren’t even the case in 1995.

uconnectlol · on Jan 8, 2023

Oh, I see, you were taking that literally. I'd estimate more like 2-5 times per day. Of course this entirely depends on which content you were viewing, and as soon as you ventured out side of the good boy zone it got much worse.

ipython · on Jan 7, 2023

Are you looking for the movie The Blues Brothers? (Not ChatGPT, I promise)

uconnectlol · on Jan 7, 2023

EDIT: Nah that's not it. It's way too old. And in mine the police crash for no reason before they even get anywhere near the scene of the crime.

I will skim thru it, but it doesn't look like the right setting. The movie was some generic action movie for some reason I remember Denzel Washington being in it but I looked through his movies from 2006-2012 and didn't find it. It was around that time. Setting had to be something like late 90s or later. Black SUVs dispatched and crashed for no reason at all. I have no idea what the movie was about but it was hilarious when this happened because IIRC it was a serious movie.

stackbutterflow · on Jan 7, 2023

Unbreakable with Bruce Willis?

uconnectlol · on Jan 8, 2023

I skimmed through it at 10 second intervals but I didn't see any police cars?

swyx · on Jan 8, 2023

can you recall anything else about the movie? its not a lot to go on if its not a major plot point

uconnectlol · on Jan 8, 2023

Literally all I remember is one scene where a bunch of cop cars or vans get dispatched for some reason like someone's about to die or a heist is happening, something of that nature. They all crash and roll over and wipe out on normal traffic roads for absolutely no reason, and the movie isn't even a comedy, just a generic action movie.

I can't remember one single thing other thing about it, not even the plot.

I do remember an urban setting.

thegabriele · on Jan 8, 2023

A movie where police cars crash without any reason and it's not even a comedy, it's in the range of Steven Segal body of work

uconnectlol · on Jan 8, 2023

It was a normal Hollywood movie as far as I could tell. I would have noticed if it was a Steven Segal movie.

swyx · on Jan 8, 2023

yeah probably lost forever

this might be a good case for lifelogging, at least it would give you a limited set of movies to search

Eisenstein · on Jan 7, 2023

Yeah but in those days you could call up the local video store and ask the clerk that question and he or she would tell you it was Donnie Darko.

coredog64 · on Jan 7, 2023

I thought it was “The World According To Garp”

helloworld11 · on Jan 11, 2023

Donnie Darko, btw.

yigitkonur35 · on Jan 8, 2023

Donnie Darko (2001)

benmorris · on Jan 7, 2023

ChatGPT is already chipping away at my Google searches. I'm more and more just starting there for coding related questions.

I've had chatgpt help me with the following just this week among other things.

- solved a complex issue with scaling and transformations using pixijs upon providing a code example. The solution was 99% there and saved me hours of debugging.

- refactored JavaScript to the async/await pattern

- simplified a complex nested logic in c# with citations

- scaffolded a MySQL create table statement upon providing several c# classes. Again saved me the tedious steps of typing them out.

jpollock · on Jan 8, 2023

I asked ChatGPT to decode a Diameter header in Java (RFC 6733 Section 3).

It supplied this :

    // Parse the Diameter header
    ByteBuffer buffer = ByteBuffer.wrap(message);
    int version = buffer.get() & 0xff;
    int flags = buffer.get() & 0xff;
    int length = buffer.getShort() & 0xffff;

First problem, it's version, length, flags. Second problem, length is 24 bits, not 16. Third, 24 bits unsigned won't fit in a Java int (which is signed). Then there is the extra masking, which isn't a bug but is (I'm reasonably sure) unnecessary.

That's 3 sneaky bugs in 4 lines of code, and it didn't even try to parse the rest of the header.

I'm impressed that it produces _anything_, but it's dangerous to trust.

ChatGPT is as reliable a source as any friend in a pub after 3 beers. It is definite in its answer, convincing with its phrasing and more than likely misremembering something it overheard on the radio while driving to work.

When I tried re-prompting, it produced more complicated, just as incorrect code.

holoduke · on Jan 8, 2023

Same here. Worked on a lazy list in js for mobile. The component had a nasty array lookup every scroll event. I asked chatgpt if it could optimize the lookup algorithm. I suggested me to implement a binary search solution including the code itself. It worked immediately. I would have taken me probably a day to come up with something similar. I also asked chatgpt to minimize my code including all method names , variables etc. Unfortunately the output of chatgpt is limited to x characters. 200kb of text isnt possible. But it is able to minimize. You can even ask to uglify the code by using animal names or street names in London.

romeros · on Jan 8, 2023

type "continue" after the initial output and chatgpt should continue printing out the answer

throwawaymaths · on Jan 7, 2023

ChatGPT struggled to tell me how to pin the bottom of a div to the bottom of its parent when scrolling (like a chat window does). It gave me the first thing that I tried (which is wrong). Eventually I figured it out, then confronted ChatGPT about it and it insisted on the wrong answer, and blithely dismissed the correct answer.

twic · on Jan 8, 2023

Sounds like it's ready to be promoted to architect.

lamontcg · on Jan 8, 2023

I've been trying to get ChatGPT to solve pretty basic calculus questions and it is often either totally wrong or wrong in some tiny detail. I got into an argument over dimensional analysis a few weeks ago where it felt like it was gaslighting me.

throwawaymaths · on Jan 8, 2023

I find ChatGPT very often hallucinates things and then tries to gaslight me when I present the correct answer. I wonder where it got this habit from.

tester457 · on Jan 8, 2023

I like chatgpt for programming but I don't like the sound of using a language model for math. I rather use Wolfram alpha

lamontcg · on Jan 8, 2023

programming and math are the same thing

usrnm · on Jan 8, 2023

In the same way as building bridges and math are the same thing. There is some overlap, but not much for the every day tasks. If you want to build something completely new and unprecedented, you will need a lot more math, but still it will only get you so far

spaceman_2020 · on Jan 8, 2023

An AI struggling with CSS might be the best evidence of intelligence :)

throwawaymaths · on Jan 8, 2023

It passed the Turing test since it suggested the thing I already tried

odysseus · on Jan 8, 2023

Can you use ChatGPT without a phone number yet? I thought about signing up a while ago but don't see a reason they need my phone number. Popular VOIP numbers don't work either.

crummy · on Jan 8, 2023

> don't see a reason they need my phone number

Apparently it costs OpenAI a couple cents per prompt. So I assume this is an attempt to limit spam to keep costs down.

mrg3_2013 · on Jan 7, 2023

How did you do #1) ? I am curious. Did you past the code and ask GPT to analyze it ?

benmorris · on Jan 8, 2023

Yep, provided as much detail as possible with code samples. Pretty much as if you were emailing someone for help.

seaucre · on Jan 8, 2023

I've been using it and Google in parallel and have find it generally helpful, but I've had multiple instances where ChatGPT completely made up new library functions. Same for others I've talked to.

m3kw9 · on Jan 8, 2023

I asked many questions and it usually fail to answer the tougher or very specific situational ones.

LewisVerstappen · on Jan 8, 2023

Give some examples?

I use ChatGPT for 200-300 queries per day and it’s astonishingly accurate.

I’ll double check with Google if I’m unsure but it’s almost always been correct.

flir · on Jan 8, 2023

I've found that if you give it pairs of "Why..." questions it'll happily argue both sides of the coin ("Why is X better than Y", "Why is Y better than X"). But in giving me justifications, it gives me fodder for further traditional searches. I don't trust it to give me a correct answer, but I do trust it to define the problem space a bit.

Cipater · on Jan 8, 2023

You're typing out 300 queries to ChatGPT every day?!

m3kw9 · on Jan 8, 2023

Asked it a situation with examples where I’d use type erasure in swift. It looked good but was a mesh mash off various related things

snehk · on Jan 8, 2023

I personally think that this is true and false.

I think there are two types of people who use search engines. The first person is the one who just types in whatever they want to know and sees if Google gives them a good result. They're also the ones who use TikTok or Instagram just like a search engine. Whenever there's a search bar, this type of user profits from the search term being fed into some AI to get the best result possible.

Then there's "me and everyone else who has complained about Google getting worse and worse over time". Basically people who are very good at googling things. This might sound arrogant or something but one of my skills is that I know (or knew) how to use Google. I would not just type in whatever I needed to find, I would know to exclude specific terms using "-term", put parts of a sentence in quotation marks, add other terms I knew would be on the site where the answer could be found and whatnot. The search is then very specific and if I didn't get a satisfying answer it was because I needed to improve my query. Now whenever I do that, Google takes this query and modifies it without me knowing what they're actually doing. It seems harder and harder to tell Google to just work like it used to.

It has become frustrating because I still google the same way thinking this is the best way to get to relevant results. It seemingly isn't anymore and I find myself just typing in whatever these days, getting results without me needing to specify what I'm actually looking for.

gnicholas · on Jan 8, 2023

Yesterday my wife mentioned she was looking for a book with a summary of Hong Kong history. I asked ChatGPT and it spit out a list of 5 books, authors, and short descriptions. Like much of ChatGPT's output, it appeared authoritative but was complete bunk. The books weren't real. The authors were names of real people, and in some cases actual scholars. But the list was useless.

_qzu4 · on Jan 7, 2023

ChatGPT will flood the internet with articles like how SEO was used to hack search. It might not replace search engine, but search results will becomes even more irrelevant.

alsodumb · on Jan 7, 2023

There are many ways companies might try to get some ad money in a ChatGPT setting: obvious and non-obvious product placement in the answer.

Q. Find me the best italian restaurant in New York. A. It's likely subjective - let's just sell it to the highest bidder.

Q. How to improve mental health? A. Give a reasonably good answer and probably mention: some say a solar lamp is good in winter months, there's a good one:

I somehow feel like people can be easily baited into buying something when you mix good, useful information with ads. Kind of like how reputed youtubers/niche influencers do product placement and make referral money.

All of this kind of makes SEO obsolete as ChatGPT ads might end up having more conversions than regular Google ads.

janalsncm · on Jan 8, 2023

Isn’t that essentially the point of companies like Jasper.ai? Their explicit promise is to use bots to write content to rank on the first page [1].

[1] https://www.jasper.ai/surfer

wut42 · on Jan 7, 2023

I can imagine AI replacing search engines in a somewhat far future, but in the meantime, it'll mostly be used to flood the internet with many low quality articles. Consequence will be as you said, but I also wonder if it won't lower the quality of new AI trainings as well.

pixl97 · on Jan 8, 2023

How long before the AI learning from itself dominates the literature people read influencing how we speak and think?

wut42 · on Jan 9, 2023

Who knows!

There's "precedents", tho. MidJourney, the art AI, trained Stable Diffusion over favorites generations of their community, making it a "fine tuned Mid-Stable Diffusion" that was quite amazing (that was the --beta and --betap flags a couple of months ago there).

It can be bad, but it can also be great.

idopmstuff · on Jan 7, 2023

But on the search side, it'll also get rid of the need for the kind of SEO/blogspam stuff you're describing.

We already see it with recipes - there are all the god-awful sites that have 1000 words of fluff before you get to the recipe, because that's what you need for SEO purposes. GPT is really good at creating that kind of useless fluff, but it also obfuscates the need for it, because it'll just give you the recipe you want without the need to go to a website at all.

gunshai · on Jan 7, 2023

in the short term yes. in the long term I think SEO will disappear. my buddy who works in the content creation space for indeed is telling me that they are already starting to use gpt to replace content creation, but know that it's a matter of time before that won't work.

chiefalchemist · on Jan 7, 2023

Which will ultimately drive more people directly to ChatGPT and just cut out the search middle-person.

_qzu4 · on Jan 8, 2023

Not necessary. The google search function still works fine, its just the content becomes worst. If the content is generated from ChatGPT, I don't see how it will cut out the middle-person. I believe a more likely scenario is people appending site:reddit.com for all of their search. It will be domain specific.

chiefalchemist · on Jan 8, 2023

Eventually, the masses will understand that 75%+ on the first SERP page are AI sourced. Why bother with the middle if it adds too little value?

cs702 · on Jan 7, 2023

...But ChatGPT's successors just might.

I, for one, look forward to a future in which I can ask a digital assistant to find and summarize information on anything:

"Computer, please find out how GPS works and explain it to me like I'm five years old."

"Computer, what are the latest Covid-19 infection stats in my city?"

"Computer, who is Claude Shannon and why is he important?"

"Computer, what are the top three stories on HN today?"

"Computer, what's on my calendar?"

etc.

IanCal · on Jan 7, 2023

I'll be the first to say that chatgpt already does very well at the first and third examples. WebGPT sounds like the right approach for the others (understanding how to formulate a search query, then searching, then summarising the results)

Edit

I think the direction of these models seems clear, and their capabilities however limited at the moment are outstanding. I think the best measure of performance is what the naysayers complain about. We've quickly gone from "actual gibberish that looks like paragraphs" to "yes but it loses track after a few sentences" to "yes it's kind of coherent but doesn't make sense totally" and are now at "it's like an overconfident human at many tasks, when they're only somewhat qualified in the area". People are pointing out where it's got an API call wrong, without it having any ability to look up the answer and see if the function actually exists.

I spoke to it about distributed computation design and discovered something new I can actually implement in my design.

I'm very excited about where this will all go because it seems useful now and I'd be very surprised if this is the best that it ever reaches.

cs702 · on Jan 7, 2023

I agree (with minor caveats).

The important thing is that these AI systems are only going to get better over time.

IMHO, the early 2020's for conversational AI is akin to the 1980's for personal computers.

IanCal · on Jan 7, 2023

Absolutely, I totally agree. I edited my comment to add in more as I realised I'd not addressed that. It's surprisingly good right now, and given that it's useful that feeds into even more attention and money being spent on making it better.

Anyone not looking at chatgpt and thinking "what if this was even just a bit better? Or had better iterations or lookups?" is missing out on what I think is a revolutionary change coming probably sooner than people expect.

axiom92 · on Jan 7, 2023

"Computer, please find out how GPS works and explain it to me like I'm five years old."

ChatGPT:

GPS stands for Global Positioning System. It is a way that we can figure out where we are in the world using special satellites.

Imagine you are standing outside and you see lots of stars in the sky. Well, GPS works kind of like that, but instead of stars, there are satellites orbiting the Earth. These satellites are way up high, so they can see everything on the ground.

When you have a GPS device, like a phone or a car, it can talk to these satellites and ask, "Where am I?" The satellites send back a message that says, "You are here!" and gives your device a special code.

The GPS device can use this code to figure out exactly where you are on a map. It's kind of like a magic trick! You can use GPS to find out where you are, or to find your way to a new place. It's super helpful when you're lost or want to go on an adventure.

Not bad!

jakeinspace · on Jan 7, 2023

The second half of that explanation is completely wrong though, isn’t it? There is no active signal being sent from your GPS antenna, no two-way communication.

daveFNbuck · on Jan 7, 2023

The first half is wrong too. GPS has nothing to do with satellites being able to see lots of things from high up. The whole thing is just nonsense that looks plausibly like an explanation until you try to decipher it.

charcircuit · on Jan 8, 2023

>GPS has nothing to do with satellites being able to see lots of things from high up.

It does in a sense because the radio waves need an approximate line of sight to reach your GPS receiver. Being high up gives them a large coverage.

ygggvbbjiuygvf · on Jan 8, 2023

So just like a typical explanation of things for five year olds?

aw1621107 · on Jan 8, 2023

Achieving simplicity by glossing over details is distinct from achieving simplicity by stating something incorrect.

Which one is more appropriate and/or typical will probably depend on what questions the five-year-old is asking, but I think it's reasonable to say the former is usually preferable to the latter.

crabmusket · on Jan 7, 2023

I didn't even notice that bit (maybe GPT is a flat-earther?).

Something GPT discourse has been demonstrating to me is that I'm not usually a very careful reader. I apparently skim a lot. Or maybe I skim GPT outputs because I'm biased in my expectations already?

jakeinspace · on Jan 8, 2023

I’ve noticed this too! It’s excellent at mimicking an expert voice, and it puts me off guard.

IanCal · on Jan 7, 2023

Here's it's reply to me:

Ian Calvert Computer, please find out how GPS works and explain it to me like I'm five years old

Sure thing! GPS stands for Global Positioning System. It's a way to find out where you are on Earth.

GPS works by using satellites that are way up in space. There are lots of these satellites orbiting around Earth, and they are always sending out signals. Your GPS receiver is a special device that can pick up these signals from the satellites.

The GPS receiver knows how far away it is from each satellite because of the time it takes for the signal to reach it. By figuring out how far away it is from a few different satellites, the GPS receiver can figure out exactly where it is on Earth.

So, if you have a GPS receiver in your car, phone, or on a special device like a watch, it can use the signals from the satellites to tell you exactly where you are and how to get to where you want to go. Does that make sense?

---

I was also able to ask it to make it simpler and simpler and it did so pretty well.

burnished · on Jan 7, 2023

Interesting, the one you got id basically correct.

jfk13 · on Jan 7, 2023

And comparing these two results for the same question serves to illustrate how ChatGPT does not in any meaningful sense _understand_ what it is "talking" about. It basically just strings together relevant-seeming words in a plausible way. The result may be useful, or it may be nonsense.

You'd better know enough (and be alert enough) to tell the difference...

gl-prod · on Jan 8, 2023

And because of how it responds to them, people often forget that ChatGPT is just a language model, not an all-knowing AI.

AlotOfReading · on Jan 7, 2023

GPS doesn't talk to the satellites, but the satellites do send a message that allows the GPS to answer "where am I" for itself. It's not the worst way to explain things at that level, even if it's misleading.

About half the (presumably human) "eli5 GPS" answers I found on Reddit made the same mistake, so chatGPT just copied an apparently popular misconception.

notahacker · on Jan 7, 2023

> About half the (presumably human) "eli5 GPS" answers I found on Reddit made the same mistake, so chatGPT just copied an apparently popular misconception.

I think that's one of the biggest problems with using LLMs for accurate answers. A lot of the text that's very useful for modelling human language and questions is also full of factual errors and questionable opinions. Answers being wrong is a problem with old-fashioned web search too, but I think LLMs necessarily lean more on quantity rather than quality of content...

ufmace · on Jan 8, 2023

The benefit of web searches, and trouble with LLMs, is you can see the site that an answer is on and judge the credibility based on that. If I see that the answer is on Reddit ELI5, then I know that there's no assurance about how accurate the answer is. If it takes me to a page for a university engineering or math department, or the blog of a dude who built a receiver, then I can be a lot more confident that the answer is correct, though it might be technically over my head.

shagie · on Jan 7, 2023

It is better at doing "translations" from one "language" to another "language". While it does know stuff and can answer questions about what it knows, that isn't its strength.

If you can do "here is the structured data, summarize it and make sure to present this information" it can do that quite well.

So far I've played with GPT doing a zero shot classification of HN titles and CSV hourly weather data to summary. In each case, it has done quite well.

Neither of those are things about what it "knows."

monsieurbanana · on Jan 7, 2023

Might not be the worst way to explain how GPS works, but definitely up there.

You'd be better off just saying "it uses satellites and stuff".

ben_w · on Jan 7, 2023

Yes, but unlike most times it gets things wrong, this is probably appropriate for explaining things to 5 year olds.

(Also, the UK comedy quiz show QI made the exact same mistake).

aw1621107 · on Jan 7, 2023

> this is probably appropriate for explaining things to 5 year olds.

I feel it's appropriate in terms of simplicity, but I think providing a simple wrong explanation is really only worth it when the more accurate explanation is substantially harder to simplify, and I don't think explaining how GPS works falls into that category.

Something like this I think is more accurate without being any more complex:

> Imagine you are standing outside and you see lots of stars in the sky. Well, GPS works kind of like that, but instead of stars, there are satellites orbiting the Earth. These satellites are way up high, so you can always see some of them.

> When you have a GPS device, like a phone or a car, it can look for these satellites, and if it sees enough of them it can use math to figure out where you are.

jakeinspace · on Jan 7, 2023

I’ll admit it was a decent answer. However, even a somewhat bright person (especially someone with radio / networking experience) could probably have realized that was an issue without any prior knowledge of GPS. They know that their phone has a GPS receiver, and that satellite phones accessing geostationary or high altitude satellites are much bulkier. Clearly, your cell phone isn’t transmitting to each satellite.

How will future LLMs be able to conduct that level of reasoning?

wolfgang42 · on Jan 7, 2023

You can just as easily ELI5 this without spreading misinformation, though. Just change the third paragraph to:

> The satellites all broadcast a special code saying where each of them is. When you have a GPS device, like a phone or a car, it can listen to these codes and find out where and how far away different satellites are.

(Actually, this would be a fun thing to do with a class. Have a few kids march around the edges of the playground, saying things like “I’m next to the swingset now!”, and then ask one one kid to close their eyes, put them somewhere in the middle and ask them to guess where they’re standing.)

x86x87 · on Jan 7, 2023

The explanation is misleading. The satellites don't see you and don't care about you. They are beacons sending a radio signal (the beacons says: i'm satellite X and here is a timestamp). The receiver based on the known position of the satelites and the beacon signals received figures out your position on earth using math!

So the explanation is either: satellites and their position or give the full explanation. Not this regurgitation.

burnished · on Jan 7, 2023

Except for having nothing to do with how GPS works. The satellites essentially broadcast what time it is and your reciever uses the delay between broadcast and reception to calculate where/when it is.

Explain like Im five means break it down, not lie about the complicated bits.

aortega · on Jan 7, 2023

>Not bad!

Its terrible. This is totally not how GPS works, I.E. the terminal never sends a message to the satellite. This is the danger of early LLMs, a large amount of their output is false and there is no way to tell unless you are already an expert.

This makes them basically useless except for a toy.

tarl0s · on Jan 7, 2023

Asking it to “explain how GPS works like I’m a seasoned engineer” yields a more detailed explanation and there’s no mention of GPS terminals sending messages back to the satellites.

Probably the “false output” you mentioned is due to the fact that the prompt asked to “explain it like I’m five”, which made ChatGPT answer with a “dialogue between terminal and satellite” explanation, which (arguably) may be better understood by a 5 years old.

theteapot · on Jan 7, 2023

Did ChatGPT come up with this, or just rip it off from some place(s) without attribution?

ben_w · on Jan 7, 2023

I've seen this sort of response a lot lately, both for GPT and all the image generators.

Google, and all the other search engines, literally show snippets of the original content to display search results, and have had copyright infringement lawsuits over this.

Yet somehow, it's taken these AI, which generally aren't just block-quoting things unless you ask them to (and often getting it wrong if you do), for non-lawyers to start asking that question.

theteapot · on Jan 7, 2023

I think that just indicates it's more blatant. With search engine results I can click through to the original source. That's attribution, at least to the laymen. ChatGPT has no such thing.

ben_w · on Jan 7, 2023

I selected a few sentences from the given response, asked DuckDuckGo to find them, and got zero hits for any of them.

GPT definitely has some kind of internal model of things and how they relate to other things, and the capacity to express those relations in different ways.

Whether this is functionally the same as the model in my head, or some analog to it, or if such descriptions are merely cute but misleading anthropomorphisation, is far beyond my knowledge of neuroscience, psychology, and indeed machine learning.

theteapot · on Jan 7, 2023

> I selected a few sentences from the given response, asked DuckDuckGo to find them, and got zero hits for any of them.

Science.

schrodinger · on Jan 7, 2023

Is it any different than an infinitely capable human reading the entire internet and using the knowledge it gained to answer questions?

ben_w · on Jan 7, 2023

Google or GPT?

Either way I don't know. Might be the same, but that might be an unwarranted anthropomorphisation.

thedorkknight · on Jan 8, 2023

I asked it to explain it to me like I'm Shrek.

Imagine that the GPS satellites orbiting the Earth are like the magic lanterns in the sky, sending signals down to a receiver (like the one in your phone or car). The receiver can use those signals to figure out how far away it is from each of the magic lanterns. By measuring the distance to multiple lanterns, the receiver can triangulate its position and figure out exactly where it is on the surface of the Earth. So even if you're a big, green ogre like Shrek, you can use GPS to find your way around and never get lost!

cs702 · on Jan 7, 2023

And it's only going to get better over time.

As I mentioned elsewhere, conversational AIs in the early 2020's are like PCs in the 1980's -- a new technology that still has lots of early limitations.

pwdisswordfisha · on Jan 7, 2023

I'd be more impressed if it explained set-theoretic forcing on that level.

emrah · on Jan 7, 2023

That is, if you can blindly trust the answers.. if it could provide the top sources for the information so I could fact-check it if needed, it would work

notahacker · on Jan 7, 2023

That's why in some respects, I think the sheer clunkiness of "according to website.com, claude shannon is..." or "ok, I've found this on the web..." in the current consumer tech is better than anonymous, authoritative somewhat novel answers than might be the internet's consensus opinion or might just be a vaguely relevant word vector structured like an answer

x86x87 · on Jan 7, 2023

We're going to have a generation of people that are cock sure about things because ... ai, but they won't actually know jack shift. It sort of happened with google. People can figure something out but they don't systematically get things. Shallow explanation. Goldfish attention span

ben_w · on Jan 7, 2023

I think that assumes AI makes any difference to the self-certainly of humans.

My mother was into all kinds of new age and supernatural stuff. You may think you can see where this anecdote is leading, and oh boy I could go there, but instead prepare for a twist! One of the stories she had was a client saying "It is not possible, Susan, it is not possible that there are beings out there that are cleverer than what we is".

alvah · on Jan 8, 2023

Have you had much to do with young graduates lately? We already have that generation without AI being the cause.

pattyj · on Jan 7, 2023

Is there a way to disuade users from blindly trusting responses? Such as unavoidable caveats, make no attempt to provide depth and instead strongly recommend reliable research?

And as a logical aside: if AI can't be trusted, we shouldn't trust AI's recommendations for reliable sources either.

notahacker · on Jan 7, 2023

> And as a logical aside: if AI can't be trusted, we shouldn't trust AI's recommendations for reliable sources either.

Don't think that's automatically true though: an AI can be consistently excellent at retrieving reliable sources and still get a lot wrong on its summaries. Also humans have their own opinions on sources even when they're not familiar with the detail, and finding out if the AI reads Nature or 4chan is part of the debugging process...

pattyj · on Jan 7, 2023

Sure. Really, I was thinking of the scenario where someone explictly asks for a reliable source i.e. "Thanks for summary about X. To whom should I turn for more accurate/detailed information?"

notahacker · on Jan 7, 2023

Oh yeah, it's spectacularly bad at some of that sort of thing. Ask for a list of academic papers and it'll make half of them up and give others new authors...