Skip to content

Instantly share code, notes, and snippets.

View MrCordeiro's full-sized avatar
:shipit:
In a love-hate relationship with white spaces

Fernando MrCordeiro

:shipit:
In a love-hate relationship with white spaces
View GitHub Profile
@JoaoLages
JoaoLages / RLHF.md
Last active June 23, 2025 16:32
Reinforcement Learning from Human Feedback (RLHF) - a simplified explanation

Maybe you've heard about this technique but you haven't completely understood it, especially the PPO part. This explanation might help.

We will focus on text-to-text language models 📝, such as GPT-3, BLOOM, and T5. Models like BERT, which are encoder-only, are not addressed.

Reinforcement Learning from Human Feedback (RLHF) has been successfully applied in ChatGPT, hence its major increase in popularity. 📈

RLHF is especially useful in two scenarios 🌟:

  • You can’t create a good loss function
    • Example: how do you calculate a metric to measure if the model’s output was funny?
  • You want to train with production data, but you can’t easily label your production data
@averad
averad / Stable_Diffusion.md
Last active May 20, 2025 09:01 — forked from harishanand95/Stable_Diffusion.md
Stable Diffusion on AMD GPUs on Windows using DirectML
@Kavignon
Kavignon / fix-connection-speed-linux-wls2.md
Created December 1, 2021 22:02
Resolving a slow Wi-Fi connection speed over Linux WSL2

This could be a problem cause by your Windows OS being a bottleneck and limit your speed. After some talks and discussion with a colleague, this is the solution that worked for my case!

From Windows

  • Go to your Start Menu.
  • Go to 'View Network Connections'.
  • Right-click on your Wi-Fi adapter.
  • Select the 'Properties' menu option.
  • Find the 'Internet Protocol Version 4' and select the 'Properties' menu option. (keep the IPv4 option checked)
  • Click on the 'Advanced' button.
  • Uncheck 'Automatic Metric' and enter 1 in the 'Interface metric' text box.
@csiebler
csiebler / private_repo.md
Last active February 9, 2022 13:48
Adding private Azure DevOps Artifact feeds to Azure Machine Learning

Steps:

  • Create private Feed in Azure DevOps
  • Create Personal Access Token (PAT) in Azure DevOps with Feed Read permission (details)
  • Navigate to the Azure DevOps Artifacts Feed page where you can see the details for the next steps (you'll need feed name, project name, organization name and later also package name): feed terminology
  • Create Build pipeline in Azure DevOps to create package and push to private feed:
trigger:
@gwmccubbin
gwmccubbin / DappToken.sol
Created March 4, 2020 16:31
DApp ERC-20 Token
pragma solidity ^0.5.0;
contract Token {
string public name = "DApp Token";
string public symbol = "DAPP";
uint256 public totalSupply = 1000000000000000000000000; // 1 million tokens
uint8 public decimals = 18;
event Transfer(
address indexed _from,
@yogthos
yogthos / clojure-beginner.md
Last active July 1, 2025 02:53
Clojure beginner resources

Introductory resources

@33eyes
33eyes / jupyter_notebook_w_nohup.md
Last active February 15, 2025 09:08
Jupyter notebook with nohup

Running jupyter notebook with nohup

This is helpful when running jupyter notebook on remote server (AWS) and tunneling into it, because with nohup the notebook process continues to run on the server even if you get disconnected from it (any running notebook code continues to run on the server without interruption, and just needs to be re-tunneled into).

Start jupyter notebook on remote server

In the remote server terminal, run:

nohup jupyter notebook &
@linkesch
linkesch / .sh
Created December 17, 2015 10:45
Update remote branches on local machine
# http://kb.detlus.com/articles/git/how-to-update-remote-branch-list-on-local-machine/
git remote update origin --prune
@dmglab
dmglab / git_bible.md
Last active March 9, 2024 02:59
how to git

Note: this is a summary of different git workflows putting together to a small git bible. references are in between the text


How to Branch

try to keep your hacking out of the master and create feature branches. the [feature-branch workflow][4] is a good median between noobs (i have no idea how to branch) and git veterans (let's do some rocket sience with git branches!). everybody get the idea!

Basic usage examples

@rootbear
rootbear / remove-last-row-from-table.html
Created August 2, 2012 15:17
Javascript: Remove Last Row from Table
<html>
<head>
<title>remove last row from a table</title>
<script language="JavaScript" type="text/javascript">
<!--
var rws;
function RemoveRow(obj){
obj=document.getElementById(obj);
rws=obj.getElementsByTagName('TR');
obj.removeChild(rws[rws.length-1]);