TSH code sharing tsh-code

tsh.io code-sharing account

tsh-code / requirements.txt

Created May 6, 2024 09:34

tsh-code / Dockerfile

Created May 6, 2024 09:32

	FROM amazon/aws-lambda-python:3.9

	COPY ./requirements.txt .

	RUN yum -y install gcc-c++
	RUN pip install --no-cache-dir torch --extra-index-url https://download.pytorch.org/whl/cpu
	RUN pip install --no-cache-dir -r requirements.txt

	ENV HOME /tmp

tsh-code / prompt.txt

Last active July 10, 2025 11:39

	Your task is to find unique individuals in the given text. As a result, return an array of objects in the given format:

	[
	{
	firstName: string,
	lastName: string,
	presumedGender: "male" \| "female" \| "unknown"
	}
	]

tsh-code / gpt-4-body.json

Created March 6, 2024 11:21

gpt 4 body

	{
	"model":"gpt-4",
	"messages":[
	{
	"role":"system",
	"content":"Your task is to find unique individuals in the given text. As a result return an array of objects in given format: [{firstName: string, lastName: string, presumedGender: 'male' \| 'female' \| 'unknown'}] As an answer I expect only an array of objects."
	},
	{
	"role":"user",
	"content":"${textGoesHere}"

tsh-code / openai-response-1.json

Created March 6, 2024 11:20

openai response

tsh-code / compromise-response-2.json

Last active April 4, 2024 10:17

compromise response 2

tsh-code / compromise-response-1.json

Last active April 4, 2024 10:18

compromise response 1

tsh-code / spacy-spanmarker-final.py

Created March 4, 2024 08:27

final solution

	from flask import Flask, request
	from datasets import load_dataset, Dataset
	import json
	from nltk.tokenize import sent_tokenize, word_tokenize

	nlp = spacy.load("en_core_web_trf")
	nlp.add_pipe("span_marker",config={"model": "lxyuan/span-marker-bert-base-multilingual-cased-multinerd"})

	app = Flask(__name__)

tsh-code / spacy-spanmarker.py

Created March 4, 2024 08:26

spacy spanmarker example

	import spacy

	nlp = spacy.load("en_core_web_trf")
	nlp.add_pipe("span_marker", config={"model": "lxyuan/span-marker-bert-base-multilingual-cased-multinerd"})

	def extract_people(text: str):
	entities = nlp(text)
	full_names = set()

	for entity in entities.ents:

tsh-code / spanmarker.py

Created March 4, 2024 08:25

span marker example

	from span_marker import SpanMarkerModel
	modelPreTrained = SpanMarkerModel.from_pretrained("tomaarsen/span-marker-mbert-base-multinerd")
	modelPreTrained.try_cuda()

	def extract_people(text:str):
	entities = modelPreTrained.predict(text)
	full_names = set()
	for entity in entities.ents:
	if entity['label'] == 'PER':
	# Check if the entity has both a first name and a last name