Andrew Kornilov frutik

sad and fury

frutik / gist:e4975a34ca93350fb9754013cdf2c071

Last active January 17, 2025 16:17

	from mlx_lm import load, generate

	model, tokenizer = load('Qwen/Qwen2-7B-Instruct-MLX', tokenizer_config={"eos_token": "<\|im_end\|>"})

	prompt = "Why people call putin khuilo."
	messages = [
	{"role": "system", "content": "You are Qwen, created by Alibaba Cloud. You are a helpful assistant."},
	{"role": "user", "content": prompt}
	]
	text = tokenizer.apply_chat_template(

frutik / hamming_dist.sql

Created November 20, 2024 20:03 — forked from MNoorFawi/hamming_dist.sql

Hamming Distance in PostgreSQL Database

	CREATE OR REPLACE FUNCTION hamming_distance(
	A0 bigint, A1 bigint, A2 bigint, A3 bigint,
	B0 bigint, B1 bigint, B2 bigint, B3 bigint
	)
	RETURNS integer AS $$
	BEGIN
	RETURN
	bits_count(A0 # B0) +
	bits_count(A1 # B1) +
	bits_count(A2 # B2) +

frutik / bloated_indices

Last active October 22, 2024 20:48

useful postgresql queries - when your postgres went crazy

	SELECT
	c.relname AS index_name,
	pg_size_pretty(pg_relation_size(c.oid)) AS index_size,
	pg_size_pretty(pg_total_relation_size(c.oid) - pg_relation_size(c.oid)) AS index_bloat_size,
	ROUND((pg_total_relation_size(c.oid) - pg_relation_size(c.oid)) / pg_relation_size(c.oid)::numeric * 100, 2) AS bloat_percentage
	FROM
	pg_class c
	JOIN
	pg_namespace n ON c.relnamespace = n.oid
	WHERE

frutik / gist:3b810324109af7d820f598d2d4501002

Last active September 22, 2024 11:43

	import onnxruntime as ort
	from transformers import AutoTokenizer

	session = ort.InferenceSession('./bge-small-en/model.onnx')
	tokenizer = AutoTokenizer.from_pretrained("./bge-small-en")

	inputs = tokenizer("hello world.", padding="longest", return_tensors="np")

	inputs_onnx = {key: ort.OrtValue.ortvalue_from_numpy(value) for key, value in inputs.items()}

frutik / gist:81f31dc179eccd48da99900db4f861cc

Last active January 26, 2024 08:38

	https://brandur.org/fragments/postgres-partitioning-2022
	https://pganalyze.com/blog/postgresql-partitioning-django
	https://django-postgres-extra.readthedocs.io/en/master/table_partitioning.html
	https://hevodata.com/learn/postgresql-partitions/
	https://www.2ndquadrant.com/en/blog/postgresql-12-foreign-keys-and-partitioned-tables/
	https://www.postgresql.org/docs/current/ddl-partitioning.html#DDL-PARTITIONING-DECLARATIVE-LIMITATIONS

frutik / gist:bbd953135fa098f0c8567305888b4667

Created November 16, 2023 14:28

	def divide_chunks(l, n):
	for i in range(0, len(l), n):
	yield l[i:i + n]

	a = 'i.strip() for i in a.split('.') if i.strip()]
	c = list(divide_chunks(b, 3))
	d = ['. '.join(i + ['']).strip() for i in c]
	y = '\n\n'.join(d)
	print(y)

frutik / gist:332772276dbd6d0b9e3a16d3e69a2220

Created October 17, 2023 12:52

kubectl get --raw "/apis/metrics.k8s.io/v1beta1/nodes" -v=8 | python -m json.tool

frutik / Data analyst

Last active October 11, 2023 14:24

	Must have

	- Google Analytics / Tag manager / Funnels
	- Set up and maintain pipelintes for for importing Analytical data to Data lake of your choice
	- Defining KPI, Metrics that will help us to understand our visitors better.


	Nice to have

	- BigQuery

frutik / gist:44c9930561552cacbffd510e4b2bb6d4

Last active October 9, 2023 12:57

	https://medium.com/@kelvin.lu.au/compare-pdf-question-answering-with-openai-and-google-vertexai-46638d62327b
	https://medium.com/@kelvin.lu.au/what-we-need-to-know-before-adopting-a-vector-database-85e137570fbb
	https://medium.com/@kelvin.lu.au/disadvantages-of-rag-5024692f2c53

	https://medium.com/@Ratnaparkhi/how-the-search-technology-is-evolving-88607f5efb9e

frutik / gist:8c2e2918c5e72d50e4fa09da5f932363

Created September 27, 2023 11:03

cat  en_esci.json | grep '"Clothing"' | grep -E '"Men"|"Women"' | jq -c  '. | [.category, .image]' | grep -v '],""]' > clothing.txt