Last active
January 11, 2024 21:57
-
-
Save ZHAOZHIHAO/7004469df04ce2f568a68278d40bd1a5 to your computer and use it in GitHub Desktop.
new things
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
ChatGPT: | |
introduction https://www.semrush.com/blog/how-does-chatgpt-work/# | |
Model quantization: | |
Basics | |
A White Paper on Neural Network Quantization | |
how to implement the quantization aware training from scratch | |
"EFFICIENT QUANTIZATION-AWARE TRAINING WITH ADAPTIVE CORESET SELECTION" | |
how to implement post training quantization training from scratch using pytorch | |
https://jermmy.github.io/2020/07/04/2020-7-4-post-training-quantization-2/ | |
torch.round gradient | |
https://discuss.pytorch.org/t/torch-round-gradient/28628/6 | |
https://www.reddit.com/r/pytorch/comments/lqi315/how_does_quantize_per_tensor_work_in_relation/ | |
why doesn't pytorch support quantized model inference on GPU? | |
https://stackoverflow.com/questions/69718379/running-pytorch-quantized-model-on-cuda-gpu |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment