Hacker News new | past | comments | ask | show | jobs | submit | morgango's comments login

Please don't listen to all of these folks who are trying to bring conflict into your life. You did something interesting, and learned a lesson about how big institutions actually work. It is a great story, and one that you can tell your friends.

The easiest path forward is to do what it takes to graduate, it sounds like you are one quarter away. Smile, play nice, help out where you can. Get everything in writing.

Definitely TALK to a lawyer and have that in your back pocket. It is likely there is some sort of legal aid through the law school and you can. However, only use this as a last resort. It would be no problem for a university to drag something like this out for months or years and you will be left without a degree.


That's reasonable advice if he could cease to work on his project and status quo were resumed, but from his telling of events, it sounds like they're saying he can't study at all unless he works for them for free. Such a scenario would be extortion, and is worth taking the stand on IMO.

There is already conflict in his life at this point. The question is how best to resolve it. The school is in a position of authority and is telling the student that after spending tens of thousands of dollars at his school, he can't register for his final quarter necessary to graduate, unless he provides additional free work for them. This absolutely should not be tolerated.

Everyone in this thread is simply suggesting he talk to a lawyer. The lawyer can help guide him on the next action to take.


Like the Roma?



But as a comparison of scale, both the airports in Denver and Dallas-Fort Worth are larger than Manhattan (27 mi2 or 70 km2).

https://en.wikipedia.org/wiki/Dallas_Fort_Worth_Internationa....

While that is a ton of activity (and empty space, if you have ever seen those airports), the Big Apple might not be the best reference for scale.


Great point!

(Disclaimer: I work for Elastic)

Elasticsearch has recently added a data type called semantic_text, which automatically chunks text, calculates embeddings, and stores the chunks with sensible defaults.

Queries are similarly simplified, where vectors are calculated and compared internally, which makes a lot less I/O and a lot simpler client code.

https://www.elastic.co/search-labs/blog/semantic-search-simp...


I made something similar, but used duckDB as the vector store (and query engine)! It’s impressively fast

https://github.com/patricktrainer/duckdb-embedding-search


I love duckdb, but their concurrency model is very limiting:

DuckDB has two configurable options for concurrency:

1. One process can both read and write to the database.

2. Multiple processes can read from the database, but no processes can write (access_mode = 'READ_ONLY').

https://duckdb.org/docs/connect/concurrency.html


Amy specific reason to use dDB?

I've got a crapload of json q & a formatted discussions on a topic, and am trying to figure out if I just store it somewhere and query it, or do I also do vector embeddings, kinda lost with all the possible options.


Embeddings are what encode the “meaning” of a given text. Similarity search works by computing the angle between your query vector and the rest of the vectors already stored. DuckDB (and columnar stores in general) is great at aggregation. It’s particularly well suited because DuckDB is a single file. There’s no server to muck with.


There is vector type data available in duckdb now?


They call it a fixed size array type but, yes. It was added earlier this year. Works really great

https://duckdb.org/2024/05/03/vector-similarity-search-vss.h...


Yep! It was added in v0.10.0 - which was released a month or two after I made this.

This is using v0.9.1


How does their embedding model compare in terms of retrieval accuracy to, say `text-embedding-3-small` and `text-embedding-3-large`?


You can use openai embeddings in elastic if you don't want to use their elser sparse embeddings


It’s impossible to answer that question without knowing what content/query domain you are embedding. Checkout MTEB leaderboard, dig into the retrieval benchmark, and look for analogous datasets.


So we're talking maximizing embedding model per use case? Medical dats would require differnet model than say sales data? Sounds very fragmented approach.


The answer lies with a validation dataset that you create for testing.



21 direct reports is far too many to be an effective manager. I currently manage 8 technical people and 8-10 seems to be the sweet spot, where I have time to do things like have 1-1s, help with career planning, get people promoted, deal with escalations or cross-team issues, hire replacements, etc.

I have had as many as 15 and after 10 you really just have to start picking what you want to fall off of your plate as a manager. Unfortunately, the things that fall off first are the things that help your direct reports the most. Eventually, you just get behind on everything and end up being completely reactive and only able to focus on short-term solutions.

21 direct reports are a lot if you want to be a good manager and support your people.


I think it is just leaving the bag for picking up on the way back. Not really that complicated.


Hey! Have you been reading my email? This is a perfect analogy of the medium-to-large business that I work for.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: