More

daviding · 2024-08-25T17:27:27 1724606847

Some rough cost numbers of using this would be nice.

freitasm · 2024-08-25T19:18:44 1724613524

With the exception of Cache Reserves, all these features are free.

Cache Reserve pricing is based on storage usage but optional.

arghwhat · 2024-08-25T21:54:50 1724622890

Caching is free up until a service ceiling - certain file types and sizes are excluded in the free tier for example.

daviding · 2024-04-29T20:16:33 1714421793

One side effect that I don't think I like of these tools is the inevitable push for developers to just use the most popular languages. Javascript and python are the main LLM trained languages, so it just self reenforces that it is best to use those for everything. There is something a bit sad about that. I guess the dream is that they are just intermediatory data anyway, like a chatty bytecode layer.

daviding · 2024-04-25T22:23:07 1714083787

I tried this and couldn't get it to work on a Rail 7.1 Ruby 3.2, no output. The readme.md could do with some examples of what sort of recommendations come out. Edit: it outputs metric, just no recommendations so moving on.

daviding · 2024-04-23T19:18:31 1713899911

Here in Canada, there was a trend for the 'Couch Potato Portfolio' with the arrival of popular ETFs that used these splits between equities and bonds and then just rebalancing once a year. The key point was really using entire index ETFs with low MER fees on auto, with the bonds portion as the stability. 'All in' ETFs have sort of replace that.

daviding · 2024-03-18T18:43:38 1710787418

My first 'job' as a kid was working at Oasis Software on White Lightning, which was a Forth based game making kit for the ZX Spectrum (amongst other platforms). About a million years ago now. Or now ago million, if you know Forth. https://worldofspectrum.org/archive/software/utilities/white...

bitwize · 2024-03-18T19:22:36 1710789756

No, in Forth it would probably be:

    1 MILLION YEARS AGO NOW

In other words, push 1 to the stack, multiply TOS by 10^6, multiply TOS by 86400*365, negate TOS, and finally add TOS to the current time in seconds.

Forthers did this kind of thing all the time -- invent Ruby style DSLs based on Forth's stack semantics.

daviding · 2024-03-18T19:25:26 1710789926

White Lighting was really a sprite editor packaged with an interrupt driven DSL for games, but back before DSL was a term. Forth was a good choice for a compiled target on the platforms back then, given the constraints. Ironically I do ruby now.

crq-yml · 2024-03-18T20:36:04 1710794164

I've picked up Forth to code for Agon Light, which is a new spin on those old platforms - it's BBC Micro-patterned, but uses some modern chips to make things faster and roomier. Easy to develop for on emulator too. One of the complaints I've read about Forth on micros was that even 8Kb interpreter overhead would be too much for a commercial game - what I've gathered is that many people used it primarily as a macroassembler in that context. But I have 512Kb and 20MHz to play with, plus external display processing, so I can probably ignore that overhead and do just fine.

I find Forth quite a bit easier to work with when I treat it as an interactive stack machine and do the bulk of development through offline code generation. That gives me a best-of-both where I can do useful things interactively, but I can add compiler tech to help rein in and structure whatever I'm doing.

bitwize · 2024-03-19T18:57:38 1710874658

I frickin' LOVE the Agon Light. To me it's the most exciting and promising of the "retro-inspired new computer" designs.

I've messed about with Forth on the Commodore 64 (via THEC64). It's fun but I haven't dug too deeply into it, like building a game with it or anything.

LispSporks22 · 2024-03-18T19:22:20 1710789740

I play around with White Lightening occasionally because of the ease of making games with it on C64 and the comprehensive documentation (PDFs of it have been preserved).

jdswain · 2024-03-18T19:04:57 1710788697

I had the machine lightning version for the Commodore 64. I remember that the manual was printed red paper so you couldn't photocopy it.

daviding · 2024-03-18T19:17:03 1710789423

Yep, the tape to tape copying days were really happening around 1984 so the manual in either red or green paper was the only way to keep it selling. The sprite library package and Forth's closeness to machine code were its strengths. Not easy to work with though.

dukoid · 2024-03-18T18:57:56 1710788276

I didn't manage to build any Games with it, but I think it helped me to read Postscript files a few years later :)

rwmj · 2024-03-18T18:57:51 1710788271

I remember seeing adverts for this and wondering what it was. Was it used for any commercial games?

daviding · 2024-03-18T19:22:42 1710789762

Some smaller ones, but nothing big. There was a user submission contest that got some traction, but the jump up from BASIC to using this was not easy. This was the time of 'rock star game devs' but interrupts, memory paging and sprite libraries probably crushed a few teenage dreams. An old video of what it could do https://www.youtube.com/watch?v=IgmZ2Ht-QiQ

082349872349872 · 2024-03-18T19:53:40 1710791620

From the manual (those were different times indeed!):

> We are also interested to hear of any extensions or routines you may develop, and if sufficient interest is shown we will start a News Letter, and possibly, even a User Group.

antod · 2024-03-18T22:10:16 1710799816

I remember the ads for it and wishing I could afford it. It seemed so awesome, but as you mention elsewhere I probably would've struggled.

daviding · 2024-03-04T20:04:34 1709582674

This is what my local LLama 13B Q5 had to say on this:

> There are two ways to approach this problem. First, we can count each person by gender and then add them up. Second, we can use the given information about the siblings and their genders to find a quicker solution.

> Let's go with the second method:

> 1. Sally has one sister (herself).

> 2. Each brother has two sisters. So, there are 3 sisters for each of her brothers.

> 3. Since there are three brothers, there are a total of 3 x 3 = 9 sisters for them all.

> 4. Now that we know Sally has one sister and the rest belong to her brothers, we can subtract her share from the total: 9 - 1 = 8 sisters.

>

> So, Sally has 8 sisters.

:|

TrueSlacker0 · 2024-03-15T20:07:22 1710533242

I am running the 7B and it gave: Q: "Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?"

A: "If Sally has 3 brothers, and each brother has 2 sisters, then the total number of sisters is: 3 brothers x 2 sisters per brother = 6 sisters So, Sally has 6 sisters in total."

Closer than 9 but no better.

sexy_seedbox · 2024-03-05T00:40:31 1709599231

Great! Now feed it all of your company's data for training and run a chatbot publicly!

giantrobot · 2024-03-05T12:02:04 1709640124

Sally's parents are in for a big surprise.

oreilles · 2024-03-04T23:52:38 1709596358

This is hilarious

daviding · on Feb 10, 2024

The official deprecation was made in Dec 2023. Removing it in the next feature update is too soon. Lots of people still having working WMR units, so this feels super cheap.

daviding · on Aug 15, 2023

I think the plan was for the Southwest to be given take off clearance quickly before the private jet arrived, but the controller got distracted by a route amendment for another flight that came in and they got their priorities wrong. In the US it's not uncommon for a 'Cleared to Land' to be given while something is then lining up and waiting for immediate Take Off clearance.

In the EU / Canada this isn't allowed (landing clearance is only given if the runway is completely clear), but then the US pushes a lot more aircraft through its busy airspace this way. San Diego is a busy airport with just a single runway (due to the geography) near a couple of military bases and has a steep approach over the city - if there had been some ground fog like there often is at KSAN then this could have ended very differently.

Also, you can see in the video that due to the steep approach, the SWA was at the beginning of the runway, but the jet on approach would have landed further down, as the landing is offset there. This still would have be been dangerous but not a definite case of collision.

ShakataGaNai · on Aug 15, 2023

Yea. The incoming jet was on a 5-mile final. Being a small jet it's "slow" so that 5-mile final is 3-5 minutes. You can see 19:04Z when the Southwest jet get's the "line up and wait". 19:07Z is the missed approach. If ATC has given them the departure clearance relatively quickly, there would have been no issue as Southwest would have hit the gas and been long gone... but ATC didn't.

This is not uncommon for aircraft to get on the runway and take off while another aircraft is on final. If you want a good video of how this more typically works (and sometimes doesn't work), here's one from an incident at SFO about 2 months ago: https://www.youtube.com/watch?v=DrRGde5J8mo

And another from SFO: https://www.youtube.com/watch?v=E4cewwhcL5c

In both the SFO cases, they got the "go around" from ATC much earlier and no one got too close to each other. Keeping in mind that SFO is extremely busy and lands two parallel runways at the same time: https://www.youtube.com/watch?v=YsKhZm78DuY

alistairSH · on Aug 15, 2023

then the US pushes a lot more aircraft through its busy airspace this way

This may be true overall, but San Diego is barely top-25 in the US in passenger traffic.

And Heathrow is busier (even measuring per runway, since it has 2). And both Gatwick and Mumbai handle more flights on a single runway.

jtriangle · on Aug 15, 2023

They don't land planes at KSAN in the fog, because of the steep decent. So it would have ended very differently, because that Citation would have been diverted elsewhere.

Real error here was that SWA should have been told to hold short and then expedite as the SWA boys like to do after Citation was clear or to expedite from the getgo. Both would have worked just fine.

Also worth mentioning that while you can give landing clearance onto a runway that is not clear at the time, you still cannot have multiple aircraft on a given runway simultaneously, and there are very strict minimum separation requirements that vary depending on the airport which for KSAN is 5NM

daviding · on June 29, 2023

This is an interesting article, and a bit of a mish mash of UI conventions, application use ideas for GPT and actual patterns for LLMs. I really do miss Martin Fowler's actual take on these things, but using his name as some sort of gestalt brain for Thoughtworks works too.

It still feels like a bit of a Wild West for patterns in this area as yet, with a lot of people trying lots of things and it might be too soon for defining terms. A useful resource is still things like the OpenAI Cookbook, that is a decent collection of a lot of the things in this article but with a more implementation bent.[1]

The area that seems to get a lot of idea duplication currently is in providing either a 'session' or a longer term context for GPT, be it with embeddings or rolling prompts for these apps. The use of vector search and embedded chunks is something that seems to be missing so far from vendors like OpenAI, and you can't help but wonder that they'll move it behind their API eventually with a 'session id' in the end. I think that was mentioned as on their roadmap for this year too. The lack of GPT-4 fine tuning options just seems to push people more to look at the Pinecone, Weaviates etc stores and chaining up their own sequences to achieve some sort of memory.

I've implemented features with GPT-4 and functions and so far it's feeling useful for 'data model' like use (where you're bringing json into the prompt about a domain noun, e.g. 'Tasks') but is pretty hairy when it comes to pure functions - the tuning they've done to get it to pick which function and which parameters to use is still hard going to get right, which means there doesn't feel like a lot of trust that it is going to be usable. It's like there needs to be a set of patterns or categories for 'business apps' that are heavily siloed into just a subset of available functions it can work with, making it more task-specific rather than as a general chat agent we see a lot of. The difference in approach between LangChain's Chain of Thought pattern and just using OpenAI functions is sort of up in the air as well. Like I said, it still all feels like we're in wild west times, at least as an app developer.

[1] https://github.com/openai/openai-cookbook

ignoramous · on June 29, 2023

> A useful resource is still things like the OpenAI Cookbook, that is a decent collection of a lot of the things in this article

By far, the best resource I've found is the Prompt Engineering Guide: https://www.promptingguide.ai/

> you can't help but wonder that they'll move it behind their API eventually with a 'session id' in the end

For in-context learning, I think it is fair to expect 100k to 500k context windows sooner. OpenAI is already at 32k.

daviding · on June 29, 2023

> By far, the best resource I've found is the Prompt Engineering Guide: https://www.promptingguide.ai/

Agreed, that is a good resource for sure. For tooling I like https://promptmetheus.com/ but any pun name gets bonus points from me.

> For in-context learning, I think it is fair to expect 100k to 500k context windows sooner. OpenAI is already at 32k.

It has been interesting to see that window increase so quickly. For LLM context the biggest thing is the pay-per-token constraint if you don't run your own, so have to wonder if that is what will be around in the future given how this is trending? Just in terms of idempotent calls, throwing everything in context up every time seems like it makes it likely that OpenAI will encroach on the stores side as well and do sessions?

mark_l_watson · on June 29, 2023

It is interesting to see the context window size increasing. I think the time complexity on window size is quadratic - ouch!

Der_Einzige · on June 30, 2023

Can we stop calling it "in-context learning" and call it what it is, zero-shot/one-shot/few-shot prompting instead?

Learning implies that the underlying weights of the LLM changed. They didn't.

ignoramous · on June 30, 2023

It may be wrongly used, but for better or for worse, few-shot prompting is synonymous with in-context "learning" (inference?): https://www.promptingguide.ai/techniques/fewshot / https://archive.is/D4cIW

daviding · on Jan 18, 2023

I've set this up in a staging environment of one of our apps to take a look. The staging environment we use for one-offs is on Heroku (can stand one up/down quickly), and the first issue is that a lot of the 'easy to deploy is a feature' PAAS platforms is that they bill by web/worker size restricted by memory rather than just pure virtual CPU power. Render etc all does this as well, and the memory headrooms are low.

This means that you often have to deal with, say 512 MB per instance, and then if using something like Puma, have to work out how to split concurrency vs memory footprint. What I'm finding is that v3.2 YJIT loves memory, so I have to trade off that, which means less concurrency per process. Benchmarking it quickly, the 15% gains I might get on the thread aren't worth having to move to just 2 threads on a 512 MB instance versus the 3 threads I can get with YJIT not enabled.

I think it's really neat and will continue to track it, but the performance in terms of memory trade-off isn't there quite as yet for our app profile. Not sure if others will find the same, but I guess its if their production environments aren't PAAS's with low memory headroom or not.

w-ll · on Jan 18, 2023

OT(ish) but a t3.medium on ec2 with 2v cores and ~4gb of mem is $33 a month ondemand. $20/mo on ri, and $10/mo spot.

I'm mostly done with Heroku. With one kinda big app left, all other envs, and projects are now just on aws without the tax of heroku.

And I was a big heroku fan, but their recent decisions made me shop around. I do miss the price $$ of metal in a dc, but not the price :clock: of metal.

Abishek_Muthian · on Jan 18, 2023

Have you considered fly.io?

w-ll · on Jan 20, 2023

yeah. but their 2cpu with 4gb of ram aint competitive enough at ~$22/mo. That and swearing off vendor locked ci, switched to Terraform and Capistrano.

claudiug · on Jan 18, 2023

you can use dokku + hetzner. cheap prices(https://www.hetzner.com/cloud)