A Map for Studying Pre-training in LLMs
- Data Collection
- General Text Data
- Specialized Data
- Data Preprocessing
- Quality Filtering
- Deduplication
| #!/bin/bash | |
| # Modified 4/5/2019 | |
| Version=1.4 | |
| # Original source is from MigrateUserHomeToDomainAcct.sh | |
| # Written by Patrick Gallagher - https://twitter.com/patgmac | |
| # | |
| # Guidance and inspiration from Lisa Davies: | |
| # http://lisacherie.com/?p=239 | |
| # | |
| # Modified by Rich Trouton |
| #!/bin/bash | |
| # version 1.4 - Simon Andersen | |
| # set -x | |
| export PATH="/usr/local/bin:/usr/bin:/bin:/usr/sbin:/sbin" | |
| DSCONFIGADOUTPUT="$(dsconfigad -show -xml)" | |
| COMPUTERACCOUNT="$(/usr/libexec/PlistBuddy -c "Print ':General Info:Computer Account'" /dev/stdin <<<"$DSCONFIGADOUTPUT")" | |
| if [[ -z "$COMPUTERACCOUNT" ]]; then | |
| echo "Fatal error - No computer account name found" |
Some notes on AI Agent Rule / Instruction / Context files / etc.
| #!/bin/sh | |
| # halt execution immediately on failure | |
| # note there are some scenarios in which this will not exit; | |
| # see https://www.gnu.org/software/bash/manual/html_node/The-Set-Builtin.html | |
| # for additional details | |
| set -e | |
| is_ci= | |
| for i in "$@"; do |
A complete guide to set up a modern, powerful terminal environment for developers on macOS.
Perfetto is super useful for understanding interactions between the kernel and applications. Outside of Android and ChromeOS, though it's use isn't as common. This doc tries to provide a basic walk through to get started using perfetto for upstream kernel development with classic linux distros, potentially running under qemu.
Grab the latest linux- tarball: https://github.com/google/perfetto/releases
Often the tests I’m tracing need to run as root, so because of this, I copied the
binaries in the tarball to /usr/local/bin/ and chmod +x the binaries to make
This is a full guide for people who wanted to set up Windows 10/11 VM with QEMU/KVM hypervisor enhancements for a laptop that is configured with hybrid graphics card like Intel/AMD + NVIDIA. This process will take about 1 to 2 hours, depending on your system's performance and your patience =)
There is another comprehensive guide you can follow here (shoutout to asus-linux team who made supergfxctl which is a very important tool for this guide). It is more up-to-date than mine. I would probably incorporate those information into my guide, but you are welcome to use this one as a reference!