More

jakecyr · 2024-06-08T18:07:07 1717870027

I wrote a simple library to reduce latency in voice generations from LLM chat completion streams.

This lets you generate voices from streams of text from local LLMs, such as Ollama and local TTS clients, such as Apple Say along with external clients such as Google Text-to-Speech with the same speed as privately created assistants such as OpenAI.

As each sentence end is detected, it will run TTS on it and play it out loud while the rest of the completion is being generated in the background.

jakecyr · 2024-05-24T03:38:56 1716521936

Developed an open source voice assistant that integrates OpenAI's Whisper, Chat Completion and Voice Generation APIs to provide an assistant experience.

Some potential extensions could include integrating into custom hardware or adding function calling to expand the default capabilities.

jakecyr · 2024-03-22T04:36:07 1711082167

The first (that I found) tokenizer for open source LLMs. It retrieves config files from HuggingFace and can encode / decode text and tokens.

Was created in an hour and can definitely use some work. Would love contributions and feedback!

jakecyr · 2024-03-16T23:01:23 1710630083

Simple experiment for question answering on YouTube videos using embeddings and the top n YouTube search result transcripts.

Take a question and optionally a YouTube search query (otherwise an LLM will auto-generate one), will compile transcripts for each video result, generate an embedding index using the transcripts and then answer the question using the relevant embeddings.

Returns both a string response and a list of sources that were used for the answer.

jakecyr · on Oct 21, 2023

Introducing our Video Summarization API — a game-changing tool that leverages advanced language models to summarize any YouTube video, no matter the length. Similar in technology to OpenAI's ChatGPT, our API distills key points and themes from videos, offering a quick way to grasp content without watching it in entirety. Ideal for content creators, researchers, and anyone who wants to consume video content more efficiently.

Can be easily integrated into Apple Shortcuts to summarize YouTube videos on the go (will publish an example soon).

Currently in beta, feel free to leave comments and feedback so I can improve the API.

jakecyr · on June 24, 2023

I wrote a few helper functions that let you define your functions using Python objects and generate the JSON schema dict with a method call.

jakecyr · on June 14, 2023

An open source GitHub project that let's you converse with GPT chat models with your computer or phone microphone and speaker.

jakecyr · on June 14, 2023

Using OpenAI GPT models, from a description of a software system or other entity diagram, generate a design diagram image or PDF.

jakecyr · on June 14, 2023

A simple CLI to convert a JSON or JavaScript object example to a TypeScript interface with inferred types.

jakecyr · on Dec 14, 2022

OpenAI's GPT-3 seems to be better than Google and smart home assistants so I wanted to make my own by wrapping the GPT-3 API in voice recognitions and text to speech.

I wrote a short script to recognize vocal input from a computer microphone, send the text to OpenAI's GPT-3 and respond with a voice over your speaker.