More

karsinkk · 2024-10-23T14:30:52 1729693852

Oracle 23ai also has a similar feature that "explodes" JSON into relational tables/columns for storage while still providing JSON based access API's : https://www.oracle.com/database/json-relational-duality/

karsinkk · on March 22, 2023

I was going over some of the code in the core folder for concurrency, threading and compression, what surprised me is that there’s absolutely no comments whatsoever. Agree that unless there’s excellent documentation, open source maintenance might be challenging.

Having said that, this definitely does look to be an impressive feat of engineering!

karsinkk · on March 22, 2023

This looks very impressive! As another commenter echoed, the code base is ~5million lines of C++ code, but almost no comments at all. Unless the documentation is excellent, maintenance/open source work is going to be difficult.

xpl · on March 22, 2023

The docs, for the reference: https://ytsaurus.tech/docs/en/

P.S. I wonder if LLMs could be used to generate docs and comments for big hairy codebases. Seems that the current generation of LLMs lack context to do it, but maybe it's "just one or two more papers down the line"®...

klysm · on March 23, 2023

The cost of wrong docs is pretty high. You’d need someone knowledgeable to make corrections

karsinkk · on March 17, 2023

Does anyone know of a way to check the amount of lending done through Bank Term Funding Programme so far?

karsinkk · on July 22, 2022

Lazy Adaptive Trees : https://dl.acm.org/doi/10.14778/1687627.1687669

It is a B-Tree where updates to the tree are buffered in the branch nodes and are propagated down to the leaf nodes at a later point in time.

karsinkk · on May 30, 2022

A related article : https://voxeu.org/article/algorithmic-stablecoins-and-devalu...

karsinkk · on May 16, 2022

While transforming the plan into vectors is interesting, I wish they'd gone into more detail about how the ML model prunes and filters the best plan. It is also not clear what attributes of a plan the corresponding vector encodes. I do not know much about Databloom, but it looks like this "Learning-Based Query Optimizer" is built for specific use-cases in a Data engineering/analytics setting(like K-means as cited in the article). It might not be a replacement for optimizers in traditional Databases.

BenoitP · on May 16, 2022

> not clear what attributes of a plan the corresponding vector encodes

Fig 5, page 4 from [1]:

> Topology Features

> Operator Features

> Data Movement Features

> Dataset Features

For a single logical plan, meaning it will vary in its length for another query. (which is a part I don't get: you learn a new model per query? Can you learn with a variable feature length?)

[1] https://conferences.computer.org/icde/2020/pdfs/ICDE2020-5ac...

karsinkk · on May 16, 2022

Is there any reading material that you can share about the M1 scheduler?

karsinkk · on April 26, 2022

This article just appears to quote most of the content from the original[1]. It'd be better if the original was in the front page instead.

[1] https://mymodernmet.com/ancient-egyptians-attendance-record/

karsinkk · on Feb 8, 2022

The following article is one of my favorite primers on Character sets/Unicode : https://www.joelonsoftware.com/2003/10/08/the-absolute-minim...