tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-24 02:14:17 +00:00

History

JaSpa99 2fd7004980 Implementation of SoftVC VITS SVC model (#1371 ) * [WIP]: implementation of SoftVC VITS SVC model * fix typo * fix whitespace * Fully implement Generator & Synthesizer - implement SineGen & SourceHnNSF to reconstruct source signal from F0 - source signal is added during Generator - fix various typos - start loading state dict for synthesizer * Load Synthesizer weights - Fix typos in Synthesizer - Slightly modify vits::load_checkpoint to skip a specified layer - Test with Saul Goodman model because Drake weights are on mega * start work on ContentVec - implement ConvFeatureExtractionModel for ContentVec - start work on TransformerEncoder for ContentVec: - this transformer probably needs its own MultiheadAttention implementation - fix various typos in synthesizer - add helpers to mask behavior of ~ and % operator of torch * use normal and kaiming_normal * Implement ContentVec - load ContentVec weights and config from fairseq hyperparams - use MultiHeadAttention from whisper.py - TransformerSentenceEncoderLayer might still need some tweaking, will see during inference testing - redid tilde() - some cleanup * rename the file so it can be imported * forgot to lint * use float() instead of cast() * add contentvec256l9 and cleanup * Implement SoVITS fully and run it - Fully run sovits with .wav file - Drake weights need to be manually downloaded for now - Fix bugs - Add examples/sovits_helpers - Big TODO: INVALID Kernel for recordings > 4.5 secs * temp fix for longer audio recordings * Upsample no more torch * cleanup & detailed inference time measuring * Completely remove torch(audio) - Implement sinc resample in tinygrad - Load audio via Soundfile - Some cleanups * move stuff to helper files * Cleanup * fix invalid kernel * Cleanup & add more models * Metal sounds good after master merge - But Synthesizer pass became much slower * drake weights now marked save * do load/store in numpy * no commas needed here * remove extra newline * call Tensor::where on object * use Tensor::cat instead of numpy * pull out first iteration * remove Sequential, Dropout, GELU, TransposeLast * cast during loading * clean up attention * remove SamePad * Major cleanup / line reduction - Finish implementation of GroupNormMasked - Simplify parts of TransformerEncoder - Simplify parts of Generator - Move all helpers to common section - Only use repeat_expand_left for interp after SpeechEncoder - Moved SVC-specfic ContentVec impls up (canonically) - Proper annotations for get_encoder - Finished all TODOs - Squashed some whitespaces * clean up preprocess as well * more straightforward bool expr * add demo mode		2023-08-13 19:43:23 -07:00
..
mlperf	Fix naming conflict with huggingface datasets (#1161 )	2023-07-07 10:43:44 -07:00
sovits_helpers	Implementation of SoftVC VITS SVC model (#1371 )	2023-08-13 19:43:23 -07:00
vgg7_helpers	Renamed examples/yolo to examples/vgg7_helpers because that directory contains no yolo-related code and only helper code for vgg7. This was confusing to a new user when trying to understand the examples. (#1086 )	2023-07-01 12:04:28 -07:00
__init__.py	failing llama test	2023-03-11 16:28:10 -08:00
benchmark_train_efficientnet.py	examples: numpy() array returns only one value, not an array (#1534 )	2023-08-13 14:33:05 -07:00
compile_efficientnet.py	simple exporting models (#1344 )	2023-08-01 09:35:48 -07:00
compile_tensorflow.py	moved extras/jit.py -> tinygrad/jit.py (#599 )	2023-02-25 08:32:33 -08:00
deep_deterministic_policy_gradient.py	Add pylint trailing whitespace rule (#1314 )	2023-07-21 13:37:55 -04:00
efficientnet.py	Fix plt output comment (#1428 )	2023-08-03 23:35:52 -07:00
gpt2.py	add GPT2 example (#1511 ) (#1514 )	2023-08-10 09:09:47 -07:00
hlb_cifar10.py	CIFAR 94.03% (#1340 )	2023-08-08 15:13:24 -07:00
hlb_cifar10_torch.py	Fix naming conflict with huggingface datasets (#1161 )	2023-07-07 10:43:44 -07:00
index.html	simple exporting models (#1344 )	2023-08-01 09:35:48 -07:00
llama.py	Tensor.scaled_dot_product_attention to match torch, used in LLaMA, and tested (#1502 )	2023-08-08 23:27:13 -07:00
mask_rcnn.py	MaskRCNN Inference (#884 )	2023-06-25 15:37:51 -07:00
mnist_gan.py	Fix discriminator balancing in mnist_gan example (#1332 )	2023-07-23 12:43:05 -07:00
serious_mnist.py	Fix naming conflict with huggingface datasets (#1161 )	2023-07-07 10:43:44 -07:00
simple_conv_bn.py	examples: simple conv bn	2023-07-04 13:50:26 -07:00
so_vits_svc.py	Implementation of SoftVC VITS SVC model (#1371 )	2023-08-13 19:43:23 -07:00
stable_diffusion.py	[New] SD: Refactor AttnBlock, CrossAttention, CLIPAttention to share code (#1516 ) (#1518 )	2023-08-10 15:04:18 -07:00
train_efficientnet.py	Fix naming conflict with huggingface datasets (#1161 )	2023-07-07 10:43:44 -07:00
train_resnet.py	Fix naming conflict with huggingface datasets (#1161 )	2023-07-07 10:43:44 -07:00
transformer.py	fix imports for examples/transformer.py (#1136 )	2023-07-05 08:15:13 -07:00
vgg7.py	Renamed examples/yolo to examples/vgg7_helpers because that directory contains no yolo-related code and only helper code for vgg7. This was confusing to a new user when trying to understand the examples. (#1086 )	2023-07-01 12:04:28 -07:00
vit.py	Remove Tensor.data (#565 )	2023-02-18 16:36:12 -08:00
vits.py	Implementation of SoftVC VITS SVC model (#1371 )	2023-08-13 19:43:23 -07:00
whisper.py	Removed dep of torch, torchaudio, kept librosa only (#1264 )	2023-08-02 13:52:04 -04:00
yolov3.py	Permute examples (#731 )	2023-03-29 05:07:06 +04:00
yolov8-onnx.py	Add pylint trailing whitespace rule (#1314 )	2023-07-21 13:37:55 -04:00
yolov8.py	Add pylint trailing whitespace rule (#1314 )	2023-07-21 13:37:55 -04:00