Skip to content

Instantly share code, notes, and snippets.

View HAKSOAT's full-sized avatar
🏗️
Building information retrieval systems...

Habeeb Shopeju HAKSOAT

🏗️
Building information retrieval systems...
View GitHub Profile
{'article_pid': 18951386, 'wp_templates': ['WikiProject Objectivism', 'WikiProject Novels',
'WikiProject Philosophy', 'WikiProject Libertarianism', 'WikiProject Politics',
'WikiProject Trains'], 'title': 'Atlas Shrugged', 'talk_revid': 911346471,
'taxo_labels': ['Culture.Philosophy and religion', 'Culture.Media.Media*', 'Culture.Media.Books',
'Culture.Literature', 'Culture.Philosophy and religion', 'History and Society.Politics and government',
'History and Society.Politics and government', 'History and Society.Transportation'],
'article_revid': 926765055,
'sitelinks': {'ru': 'Атлант расправил плечи', 'cs': 'Atlasova vzpoura',
'hy': 'Ատլանտը պարզեց թևերը', 'da': 'Og verden skælvede', 'ky': 'Атлант ийиндерин куушурду',
'de': 'Atlas wirft die Welt ab', 'simple': 'Atlas Shrugged', 'sv': 'Och världen skälvde',
On 8 Jne 1954, Turing's housekeeper found him dead at the age of 41; he had died the previous day.
HAKS
HEKS
HIKS
HOKS
HUKS
HJKS
H1KS
@HAKSOAT
HAKSOAT / SearchEngineer.md
Created July 17, 2021 22:11 — forked from morria/SearchEngineer.md
Search Engineer

Search Relevance Engineer

Working with the Search team, you'll be applying your background in Information Retrieval, Machine Learning or Data Mining to run experiments and develop products that have a provable impact on the Etsy marketplace. You'll be analyzing data, understanding language, developing new algorithms and building large-scale distributed systems.

Our team is responsible for creating and optimizing the best experiences for buyers and getting the best performance for sellers. Our work focuses on improvements to search ranking, query understanding, spelling correction, auto completion and query intent recognition.

Requirements

  • Strong background in Machine Learning, Statistics, Information Retrieval
import os
from asyncio import create_subprocess_shell, subprocess as aio_subprocess
from datetime import timedelta
SAVE_FORMAT = "{title}-{start}-{duration}.{extension}"
async def aio_exec(command):
@HAKSOAT
HAKSOAT / text_preprocessing.py
Created April 15, 2023 10:47 — forked from jiahao87/text_preprocessing.py
Full code for preprocessing text
from bs4 import BeautifulSoup
import spacy
import unidecode
from word2number import w2n
import contractions
nlp = spacy.load('en_core_web_md')
# exclude words from spacy stopwords list
deselect_stop_words = ['no', 'not']
@HAKSOAT
HAKSOAT / prompt.txt
Created August 29, 2025 20:45
BGE-M3-Extended-Naija
I have a task for you, I need you to build a search interface where there is a search bar that takes in a yoruba query, then on clicking search will return the top 10 results.
Each result is a verse from a Qur'an Surah.
I need you to parse the verses 1 to 135 from the files:
https://github.com/Niger-Volta-LTI/yoruba-text/blob/master/Quran_Mimo/Whole_Yoruba_Quran_0479.txt
The verses are in the files from 0462 to but not including 0479.
"""
Custom SAC implementation with trainable weights for multiple actor losses.
This example demonstrates how to implement trainable loss weights in Stable-Baselines3
SAC when you have multiple actor losses that need to be balanced.
Use case:
- Critic loss: Fixed (not weighted)
- Actor loss 1: Trainable weight w1
- Actor loss 2: Trainable weight w2