Hacker News new | past | comments | ask | show | jobs | submit | karsinkk's comments login

Oracle 23ai also has a similar feature that "explodes" JSON into relational tables/columns for storage while still providing JSON based access API's : https://www.oracle.com/database/json-relational-duality/


I was going over some of the code in the core folder for concurrency, threading and compression, what surprised me is that there’s absolutely no comments whatsoever. Agree that unless there’s excellent documentation, open source maintenance might be challenging.

Having said that, this definitely does look to be an impressive feat of engineering!


This looks very impressive! As another commenter echoed, the code base is ~5million lines of C++ code, but almost no comments at all. Unless the documentation is excellent, maintenance/open source work is going to be difficult.


The docs, for the reference: https://ytsaurus.tech/docs/en/

P.S. I wonder if LLMs could be used to generate docs and comments for big hairy codebases. Seems that the current generation of LLMs lack context to do it, but maybe it's "just one or two more papers down the line"®...


The cost of wrong docs is pretty high. You’d need someone knowledgeable to make corrections


Does anyone know of a way to check the amount of lending done through Bank Term Funding Programme so far?


Lazy Adaptive Trees : https://dl.acm.org/doi/10.14778/1687627.1687669

It is a B-Tree where updates to the tree are buffered in the branch nodes and are propagated down to the leaf nodes at a later point in time.



While transforming the plan into vectors is interesting, I wish they'd gone into more detail about how the ML model prunes and filters the best plan. It is also not clear what attributes of a plan the corresponding vector encodes. I do not know much about Databloom, but it looks like this "Learning-Based Query Optimizer" is built for specific use-cases in a Data engineering/analytics setting(like K-means as cited in the article). It might not be a replacement for optimizers in traditional Databases.


> not clear what attributes of a plan the corresponding vector encodes

Fig 5, page 4 from [1]:

> Topology Features

> Operator Features

> Data Movement Features

> Dataset Features

For a single logical plan, meaning it will vary in its length for another query. (which is a part I don't get: you learn a new model per query? Can you learn with a variable feature length?)

[1] https://conferences.computer.org/icde/2020/pdfs/ICDE2020-5ac...


Is there any reading material that you can share about the M1 scheduler?


This article just appears to quote most of the content from the original[1]. It'd be better if the original was in the front page instead.

[1] https://mymodernmet.com/ancient-egyptians-attendance-record/


The following article is one of my favorite primers on Character sets/Unicode : https://www.joelonsoftware.com/2003/10/08/the-absolute-minim...


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: