Hacker News new | past | comments | ask | show | jobs | submit login

How about going one step further and constrain transformer output with a context-free grammar? That way you can generate more conformant code such as Python or C.



This may be possible as constraints using constrained beam search, which huggingface has quietly supported for a long time.


Wouldn't even need to beam search if you restrict it to deterministic context free grammars, which would satisfy > 95% of these "generate some JSON schema" use-cases. For DCFGs you can just zero-out the probability for any token that is invalid in the context, no lookahead or search needed. Wouldn't work for truly context free things like most programming languages, though.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: