mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-24 02:14:17 +00:00
* feat: working voice 2 text using whisper * feat: added llama generation * feat: vits init * feat: more accurate voice conversion * feat: support for tts and working pipeline for the first pass * fix: linter checks * refactored vits initialization and inference, added mmts-tts support * fixed process sync and now we can have an infinite conversation * reuse output stream to remove overhead of creating a new one each time * added pre-prompt configuration with yaml files * adjusted code to merge PR which changed whisper * optimized whisper, now it's blazing fast and also reduced number of lines * added better debug printing * use jitted encode function for whisper, added timings and removed response delim to save speed on generating those tokens * fixed hf convert and now it's working with tinyllama * added tinyllama config * refactored code and made it work with all llama models * prettier order * prettier order * fixed suffix for tinyllama and refactored convert_from_hf * added missing parameters * fixed stream release and added missing params * jitted dp and encoder * jitted flow forward * removed re-init of espeak on each call to save up time * jitted generator forward for blazing fast tts * added contextmanager for displaying a chat log * removed whitespace for pylint * updated code to support latest fetch func * wait for llama eos token and pass params from cli to llama * listen for not fixed amount of time * refactored code a bit * removed thresholding and now the output streams directly to whisper * tokenize llama output for vits batch size to work and stream each sentence to a speaker * changed speaker * whisper is now printing on the same line * don't trigger llama on whisper output in parens * added tinyllama chat model * adjusted code to work with tinyllama chat model * removed unused cli arg * autofetch tokenizer and tinyllama model. add 3 chat tokens to the tokenizer * fixed issue with long sentences by chunking them * support for multiline llama output * prettified log output * adjusted sentence length * remove quote from response to avoid funny tts * fixed prompts * added missing parameter |
||
|---|---|---|
| .. | ||
| conversation_data | ||
| mlperf | ||
| sovits_helpers | ||
| vgg7_helpers | ||
| webgpu/stable_diffusion | ||
| __init__.py | ||
| beautiful_cartpole.py | ||
| beautiful_mnist.py | ||
| benchmark_train_efficientnet.py | ||
| coder.py | ||
| compile_efficientnet.py | ||
| compile_tensorflow.py | ||
| conversation.py | ||
| efficientnet.py | ||
| f16_w_uint32.py | ||
| gpt2.py | ||
| handcode_resnet50_opt.py | ||
| hlb_cifar10.py | ||
| index.html | ||
| llama.py | ||
| mask_rcnn.py | ||
| mnist_gan.py | ||
| serious_mnist.py | ||
| simple_conv_bn.py | ||
| so_vits_svc.py | ||
| stable_diffusion.py | ||
| train_efficientnet.py | ||
| train_resnet.py | ||
| transformer.py | ||
| vgg7.py | ||
| vit.py | ||
| vits.py | ||
| whisper.py | ||
| yolov3.py | ||
| yolov8-onnx.py | ||
| yolov8.py | ||