This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| MODEL_TO_USE = "NanoV2" # or "MistralNemo" | |
| IMPORT_MODEL_AND_DATA = False # set to True to import the model from HF hub, only needs to be run once | |
| REASONING_STYLE = "gpt_oss_120b" # or DeepSeek_R1_0528 | |
| NUM_NODES = 2 | |
| GPUS_PER_NODE = 8 | |
| import nemo_run as run | |
| from nemo.collections import llm | |
| from nemo.collections.llm.gpt.model.mistral import MistralModel, MistralNeMoConfig12B | |
| from nemo.collections.llm.gpt.model.ssm import MambaModel, NemotronNano12Bv2 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| If you're building Q and getting metalava errors apply: | |
| https://github.com/Magma-WIP/build_soong/commit/bcd1bb529132905cf55e72f5a2a6ba19a99f60ac | |
| https://github.com/Magma-WIP/build_soong/commit/dc3365fbde3b2a5773e655f690bb073967100795 | |
| If build fails even with the commits applied try building SystemUI with -j1 | |
| It can also be a good idea to increase swap | |
| Command: | |
| cd build/soong && git fetch https://github.com/Magma-WIP/build_soong ten-metalava && git cherry-pick bcd1bb529132905cf55e72f5a2a6ba19a99f60ac^..dc3365fbde3b2a5773e655f690bb073967100795 |