(by @andrestaltz)
If you prefer to watch video tutorials with live-coding, then check out this series I recorded with the same contents as in this article: Egghead.io - Introduction to Reactive Programming.
(by @andrestaltz)
If you prefer to watch video tutorials with live-coding, then check out this series I recorded with the same contents as in this article: Egghead.io - Introduction to Reactive Programming.
Binding | Action |
---|---|
<localleader>cd | Disconnect from the current nREPL server. |
<localleader>cf | Connect to a server using the port found in any of the files specified by clojure.nrepl/connection.port-files . |
<localleader>sc | Clone the current nREPL session. |
<localleader>sf | Create a fresh session from scratch. |
<localleader>sq | Close the current session. |
<localleader>sQ | Close all sessions. |
\sl | List all sessions. |
Good question! I am collecting human data on how quantization affects outputs. See here for more information: ggml-org/llama.cpp#5962
In the meantime, use the largest that fully fits in your GPU. If you can comfortably fit Q4_K_S, try using a model with more parameters.
See the wiki upstream: https://github.com/ggerganov/llama.cpp/wiki/Feature-matrix