More

diwank · 2024-07-03T14:53:18

While I agree that AI infra startups are hard to build, I strongly disagree with the idea that they are harder than foundational or application layer startups. I think it boils down to what you know and what resources you can muster.

For instance, foundational AI startups are also ridiculously hard to build. You need an insane amount of funding, spend it pretraining models to stay competitive only to find that gains in hardware and model architecture make them obsolete within months plus there's no real guarantee that scaling will keep working.

Application layer startups are hard in a very different way, there's an insane amount of competition and new capabilities are emerging every few weeks. I have worked with a few AI girlfriend startups and they are really struggling with keeping apace and warding off ridiculous amount of competition.

I think it's really just YMMV. Of course, the deeper you get into the stack, the more monopolizing pressure there is. Is it hard to build AI infra startups? Yes 100%. Will there be very few winners? Yes. Is it harder than foundational or application layer startups? Depends on the founders' strengths. Is it Is it a lost cause? I really don't think so.

nextworddev · 2024-07-03T19:05:55

Author here. Yes, I explicitly called out the danger of thinking application layer startups are easier, because it totally depends on the founding teams' backgrounds and interests.

diwank · 2024-06-16T17:39:35

Just to add- not easily defeated. I am even hosting a bounty for whoever can break this one (the model weights and tokenizer are available on huggingface) :D

diwank · 2024-06-16T17:38:30

Yes of course but this was just to validate the hypothesis that FMs and their tokenizers can generalize on a fully encrypted dataset.

Now that the approach is clearly viable, I am going to extend it to use a modern cipher like XChacha20 which is a gold standard.

diwank · 2024-06-16T05:47:29

Also to clarify- privacy preserving here means protecting from the model inference provider. Sort of like, if openai trained a GPT-4 using this scheme and gave the government the key. Then the gov can use it safely even while it’s hosted on OpenAI’s servers and OpenAI on the other hand does not need to share the model weights with the gov

tripplyons · 2024-06-16T21:58:42

Thanks for the clarification, this makes a lot more sense now!

diwank · 2024-06-16T05:44:58

It is possible to reconstruct the text from an embedding but also more importantly, I believe that by embedding, the OP means sending the computed embedding matrix instead of the input. Which is a simple matrix inversion problem.

Computing output embeddings is different.

diwank · 2024-06-16T05:41:49

Yep and embeddings can be decoded back into meaningful text if you have the model weights and the tokenizer.

diwank · 2024-06-16T05:40:56

It can only be decrypted using the encryption key. The inputs are encrypted and look like a gibberish and so are the outputs.

It preserves privacy by essentially making the inputs and outputs unreadable.

diwank · 2024-06-13T13:13:58

The Pagani Utopia is fairly new (launched September 2022) and is completely bereft of screens and gizmos. A complete work of art costing a mere $2.5 million

speedgoose · 2024-06-14T08:18:29

And it’s not an EV.

diwank · 2024-05-28T15:42:11

I would love to do research in Foundation Models and Philosophy of Mind.

diwank · 2024-05-09T18:48:45

We are using ellipsis and sweep both for our open source project and they are quite helpful in their own ways. I think selling them as an automated engineer is a little over the top at the moment but once you get the hang of it they can spot common problems in PRs or do small documentation related stuff quite accurately.

Take a look at this PR for example: https://github.com/julep-ai/julep/pull/311

Ellipsis caught a bunch of things that would have come up only in code review later. It also got a few things wrong but they are easy to ignore. I like it overall, helpful once you get the hang of it although far from a “junior dev”.

hartator · 2024-05-09T20:09:13

> Take a look at this PR for example: https://github.com/julep-ai/julep/pull/311

I am still confused if vector size should be 1024 or 728 lol.

diwank · 2024-05-10T02:58:16

Lolll. It’s 1024 but only for documents and not the tools (we changed the embedding model for RAG)

azinman2 · 2024-05-10T15:39:28

Why isn’t the AI suggesting putting into an appropriately named const? Magic numbers are poor practice.

hunterbrooks · 2024-05-10T18:31:14

Good catch. The team could add this rule to their Ellipsis config file to make sure that it's always flagged: "Never use magic numbers. Always store the number in a variable and use the variable instead."

Docs: https://docs.ellipsis.dev/config#add-custom-rules

azinman2 · 2024-05-10T20:43:02

But even that isn’t ALWAYS the case. There are times when it is appropriate to have numbers inline, as long as they’re not repeated.

This is where good judgement comes in, which is difficult to encode rules for.

runlevel1 · 2024-05-09T22:43:45

> I think selling them as an automated engineer is a little over the top at the moment

Indeed. Amazon originally advertised CodeGuru as being "like having a distinguished engineer on call, 24x7".[^1] That became a punchline at work for a good while.

I can definitely see the value of a tool that helps identify issues and suggest fixes for stuff beyond your typical linter, though. In theory, getting that stuff out of the way could make for more meaningful human reviews. (Just don't overpromise what it can reasonably do.)

[^1]: https://web.archive.org/web/20191203185853/https://aws.amazo...

hunterbrooks · 2024-05-10T00:40:15

As it stands today, Ellipsis isn't sold as an AI software engineer.

One of our largest learnings is that state of the art LLM's aren't good enough to write code autonomously, but they are good enough to be helpful during code review.

diwank · 2024-05-10T02:50:16

Right, I stand corrected, I think I confused the branding of other competing products. I remember really liking the fact that ellipsis does _not_ sell itself as a developer. I’ll edit my comment to reflect that. :)

_boffin_ · 2024-05-10T00:25:02

I’ve been following sweep and aider for awhile and really love what they’re both doing, especially sweep.

Would love to get your thoughts on sweep. Does it meet your expectations? If not, where does it fall short?

diwank · 2024-05-10T02:56:43

Not as a “junior dev” that sweeps markets itself as but it is useful in its own ways. For example, one really nifty way I found to effectively use it is to: - git diff - gh issue create “sweep: update docs for this file change” for every file changed

It’s not perfect even after that but gives me a good starting point and often just needs a minor change.

hasu_po · 2024-05-10T13:28:49

Any thoughts on aider vs. sweep so far? I am also interested in trying out both...