Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: Ggml.js – Serverless AI Inference on Browser with Web Assembly (rahuldshetty.github.io)
4 points by anonymousd3vil on Aug 8, 2023 | hide | past | favorite | 2 comments
What is ggml.js?

Run any ggml ported ML models directly on your web browser with ggml.js

This project provides JavaScript bindings on ggml models, so that you can embed GGML models to your web apps to build serverless AI inferencing.

Currently the framework supports following model types: - Dolly v2 - GPT2 - GPT J - GPT NEO X - MPT - Replit - StarCoder

You can find more details about the framework: https://rahuldshetty.github.io/ggml.js/

For live demo examples, you can visit here: https://rahuldshetty.github.io/ggml.js-examples/

Source Code: https://github.com/rahuldshetty/ggml.js

Please do provide any feedback/questions/suggestions in the comments. I'll be happy to take it up :)




Can I run useful llms in the browser?


Yes, the support is limited at the moment. I am planning to port llama.cpp models in the future.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: