More

gradascent · 2024-08-30T00:10:09 1724976609

They're all names of popular Kendrick Lamar songs. Don't ask me why though

hashtag-til · 2024-08-30T12:17:59 1725020279

Perhaps AI generated?

gradascent · 2024-06-08T08:12:30 1717834350

Or maybe the final moment was a sigh of acceptance and gratitude for the live he lived. Nobody knows but him.

gradascent · 2024-05-29T02:21:36 1716949296

On a similar note, I really hope that the AI companies that don't make it, but have invested a lot in curating and annotating high quality datasets, would release them to the public. Autonomous car and robotics companies in particular since that kind of data doesn't exist on the internet as abundantly as, say, natural language text.

gradascent · 2024-05-20T16:22:34 1716222154

If you want to gain familiarity with the kind of terminology you mentioned here, but don't have a background in graduate-level mathematics (or even undergrad really), I highly recommend Andrew Ng's "Deep Learning Specialization" course on Coursera. It was made a few years ago but all of the fundamental concepts are still relevant today.

antonjs · 2024-05-20T17:40:15 1716226815

Fei Fei Li and Andrej Karpathy's Stanford CS231N course is also a great intro to the basic of the math from an engineering forward perspective. I'm pretty sure all the materials are online. You build up from the basic components to an image focused CNN.

gradascent · 2024-05-06T02:27:31 1714962451

I didn't find it confusing at all. I think it's totally ok to re-use phrasing made famous by someone else - this is how language evolves after all.

alessiodm · 2024-05-06T02:29:42 1714962582

Thank you, I appreciate it.

gradascent · 2024-03-25T01:24:06 1711329846

Interesting! I would like to learn more about how AI is being applied to robotics. Do you have any suggestions for how to keep up with developments/ideas in this field?

krasin · 2024-03-25T02:34:37 1711334077

These two links could be a good start:

ALOHA-2: https://aloha-2.github.io/

RT-X: https://robotics-transformer-x.github.io/

hlfshell · 2024-03-25T07:04:44 1711350284

In October I wrote a blogpost on this subject: https://hlfshell.ai/posts/llms-and-robotics-papers-2023/

..and plan to do an updated version soon for much of what's been released since. I've also done work related to LLM and robotics integration, also on that site.

Happy to chat about it.

newswasboring · 2024-03-25T08:57:12 1711357032

Working my way through your blog post and it is so refreshing. Unfortunately my algorithm currently is showing me takes which are extreme on either end (like in your blog post).

> Technology’s largest leaps occur when new tools are provided to those that want to make things.

I love this sentence. And the general attitude of curiosity of your post.

hlfshell · 2024-03-25T16:05:39 1711382739

Thanks! Appreciate the kind words. I should have in the next month or so (interviewing and finishing my Master's, so there's been delays) a follow up that follows more advancements in the router style VLA, sensoiromotor VLM, and advances in embedding enriched vision models in general.

If you want a great overview of what a modern robotics stack would look like with all this, https://ok-robot.github.io/ was really good and will likely make it into the article. It's a VLA combined with existing RL methods to demonstrate multi-tasking robots, and serves as a great glimpes into what a lot of researchers are working on. You won't see these techniques in robots in industrial or commercial settings - we're still too new at this to be reliable or capable enough to deploy these on real tasks.

gradascent · 2024-03-22T17:09:37 1711127377

Very cool. I'm curious - did you find the results from your mixture of experts model to be (qualitatively) better than with the standard approach?

avisoori1x · 2024-03-22T17:22:14 1711128134

Thanks! So this is something I tried and qualitatively I didn't see a huge difference. I'd like to swap out my hand rolled modules with standard pytorch modules for self attention etc. and train it on the wikipedia English split. That's on my to-do list for sure.

zingelshuher · 2024-03-22T21:50:57 1711144257

I run some tests. Single model of the same size is better than MoE. Single expert out of N is better than model of the same size (i.e. same as expert). 2 experts are better than one. That was on small LLM, not sure if it scales.

gradascent · 2024-03-22T07:05:31 1711091131

fyi I think you have bias and variance the wrong way around. Over-fitting indicates high variance

wenc · 2024-03-22T07:08:46 1711091326

Thank you for catching that. Corrected.

gradascent · 2024-02-28T23:26:32 1709162792

Then perhaps a method emerges out of this to make training faster (but not inference) - do early training on highly quantized (even ternary) weights, and then swap out the weights for fp16 or something and fine-tune? Might save $$$ in training large models.

gradascent · 2024-01-24T22:51:29 1706136689

Because it shifts the burden (or at least appearance) of responsibility from those experiencing homelessness to the government orgs tasked with housing them.

irrational · 2024-01-24T23:24:40 1706138680

Uh... how does "unhoused" do that? Or, I don't see how unhoused is synonymous with "the government has not provided these people with a house". The opposite of unhoused would be housed. Is everyone that is housed in that position because the government provided a house for them?

thegrimmest · 2024-01-24T22:59:26 1706137166

Wrongly so I’d argue. It’s your own responsibility to secure a place for yourself (to live, and in society generally). Failure to do this is personal, not collective.

zilti · 2024-01-25T08:43:45 1706172225

It does not. That is not how language works.