Skip to content

Instantly share code, notes, and snippets.

View relyt0925's full-sized avatar

Tyler Lisowski relyt0925

View GitHub Profile
@relyt0925
relyt0925 / train_skills.jsonl
Created July 7, 2024 02:37
sdg train_skills.jsonl example
{"system": "You are an AI language model developed by IBM Research. You are a cautious assistant. You carefully follow instructions. You are helpful and harmless and you follow ethical guidelines and promote positive behavior.", "user": "Explain the formula of finding the area of a circle?", "assistant": "The formula for finding the area of a circle is πr², where π is a mathematical constant approximately equal to 3.14159 and r represents the radius of the circle. It is calculated as the product of π and the radius of the circle square. The radius of a circle is any line segment that passes through the center and terminates at the circumference of the circle. Let me know if you have any other questions or need assistance with something else."}
{"system": "You are an AI language model developed by IBM Research. You are a cautious assistant. You carefully follow instructions. You are helpful and harmless and you follow ethical guidelines and promote positive behavior.", "user": "How to find the area of a circle
@relyt0925
relyt0925 / test_model.jsonl
Created July 7, 2024 02:24
mt_bench test_model.jsonl example file
{"question_id": 81, "answer_id": "fTrzjmxaRqHjuA4uThwGG6", "model_id": "test_model", "choices": [{"index": 0, "turns": ["\nAloha, fellow wanderers! I've just returned from a tropical paradise that has left me with memories to last a lifetime - the enchanting islands of Hawaii. This archipelago in the Pacific Ocean is not only known for its stunning beaches and lush landscapes but also its rich culture and history.\n\nMy journey began on the island of Oahu, home to the bustling city of Honolulu. Here, I immersed myself in native Hawaiian culture at the Polynesian Cultural Center. The center showcases various Polynesian islands through interactive exhibits, traditional music, dance, and craft demonstrations. I participated in a hula lesson, learned about the art of lei-making, and even tried my hand at throwing a spear! \n\nNo trip to Oahu would be complete without visiting the historic Pearl Harbor. The USS Arizona Memorial was a humbling experience, providing an insight into the events leading up to America's
@relyt0925
relyt0925 / judge_model_single.jsonl
Last active July 7, 2024 02:22
mt_bench judge_model_single.jsonl example
{"question_id": 150, "model": "test_model", "judge": ["judge_model", "single-v1-multi-turn"], "user_prompt": "<|The Start of Assistant A's Conversation with User|>\n\n### User:\nHow have the Alps and Rhine River influenced settlement and agriculture in Western Europe? List three impacts.\n\n### Assistant A:\n\n1. Formation of distinct cultural regions: The Alps and the Rhine River have contributed to the formation of distinct cultural regions in Western Europe. The Alps, with their steep slopes and harsh climate, have traditionally been home to pastoral communities, while the fertile plains along the Rhine River have been ideal for agriculture. This has led to the development of unique cultural practices, languages, and cuisines in different regions.\n\n2. Trade and transportation: The Rhine River has been a crucial trade route since ancient times, connecting the Mediterranean region with the North Sea. Its navigability has facilitated the exchange of goods, ideas, and people between various parts of Western
@relyt0925
relyt0925 / question.jsonl
Created July 7, 2024 02:20
mt_bench_branch question.jsonl example
{"qna_file": "nowledge/technical_manual/ibm_redbooks/qna.yaml", "question_id": "133367776358949879785969799248364653018", "category": "taxonomy", "turns": ["What is the purpose of the Cross Applications - Situation Handling capability in SAP S/4HANA?"], "reference": ["The Cross Applications - Situation Handling capability in SAP S/4HANA allows specific groups of end users to be notified about important issues, defined as situations. This enables users to react immediately to the notifications they receive, improving overall efficiency and streamlining their workflow."]}
{"qna_file": "nowledge/technical_manual/ibm_redbooks/qna.yaml", "question_id": "174107797818490924604513529288460912465", "category": "taxonomy", "turns": ["How can you view the top 10 suppliers with the highest amounts payable in the Future Payables app?"], "reference": ["In the Future Payables app, you can view the top 10 suppliers with the highest amounts payable by navigating to the corresponding feature, which allows you to analyze the pa
@relyt0925
relyt0925 / judge_model.jsonl
Created July 7, 2024 02:18
mt_bench_branch judge_model.jsonl example file
{"question_id": "133367776358949879785969799248364653018", "answer_id": "N2UhZK6QumD66f7fXqoxbh", "model_id": "judge_model", "choices": [{"index": 0, "turns": ["The Cross Applications - Situation Handling capability in SAP S/4HANA allows specific groups of end users to be notified about important issues, defined as situations. This enables users to react immediately to the notifications they receive, improving overall efficiency and streamlining their workflow."]}], "tstamp": 1720310573.2916837}
{"question_id": "174107797818490924604513529288460912465", "answer_id": "eqT6a2fHUp32LtPPCBfSKB", "model_id": "judge_model", "choices": [{"index": 0, "turns": ["In the Future Payables app, you can view the top 10 suppliers with the highest amounts payable by navigating to the corresponding feature, which allows you to analyze the payables and check the numbers of open items for these suppliers."]}], "tstamp": 1720310573.2916996}
{"question_id": "251523899183662171254189060175926323095", "answer_id": "XnL6svCaaZyXAsJEV
@relyt0925
relyt0925 / base_test_model.jsonl
Created July 7, 2024 02:16
mt_bench_branch base_test_model.jsonl example
{"question_id": "101348486285295092973949491851774777550", "answer_id": "UBN58dT3Uo2r2aiQHNUQ5g", "model_id": "base_test_model", "choices": [{"index": 0, "turns": ["\nTo answer this question, we need to know the prices of both the Odyssey Ark 55-Inch and the Odyssey G55C 27-Inch monitors. Without this information, we cannot determine how many additional Odyssey G55C 27-Inch monitors could be afforded if the Odyssey Ark 55-Inch order was cancelled.\n\nHowever, I can show you how to calculate the answer once you have the prices. Here are the steps:\n\n1. Find out the price of the Odyssey Ark 55-Inch monitor and the Odyssey G55C 27-Inch monitor.\n2. Subtract the price of the Odyssey Ark 55-Inch monitor from the total amount spent on the order. This will give you the remaining budget for purchasing additional monitors.\n3. Divide the remaining budget by the price of the Odyssey G55C 27-Inch monitor. This will give you the number of additional monitors that can be afforded.\n\nSo the formula is:\n\nAdditional moni
@relyt0925
relyt0925 / test_model.jsonl
Created July 7, 2024 02:15
mt_bench_branch test_model.jsonl example
{"question_id": "101348486285295092973949491851774777550", "answer_id": "cLnJWfFDxrDSKqnWuvNTao", "model_id": "test_model", "choices": [{"index": 0, "turns": ["\nTo answer this question, we need to know the prices of both the Odyssey Ark 55-Inch and the Odyssey G55C 27-Inch monitors. Without this information, we cannot determine how many additional Odyssey G55C 27-Inch monitors could be afforded if the Odyssey Ark 55-Inch order was cancelled.\n\nIf you provide the prices, I can help you calculate the number of additional monitors that could be afforded."]}], "tstamp": 1720310372.793068}
{"question_id": "101615453773179268841614165828269868329", "answer_id": "MXnb7JBY9dNZoVVFnrbEjf", "model_id": "test_model", "choices": [{"index": 0, "turns": ["1. He plays soccer with his friends.\n2. She cooks dinner for us.\n3. I wear comfortable shoes.\n4. We often go to the library.\n5. They eat sandwiches for breakfast."]}], "tstamp": 1720310402.6988838}
{"question_id": "102616323836279432580264431649654588396", "answer_i
@relyt0925
relyt0925 / judge_model_single.jsonl
Created July 7, 2024 02:11
example file of mt_bench_branch model_judgment judge_model_single.jsonl
This file has been truncated, but you can view the full file.
{"question_id": "76341820815379879552305133803024133009", "model": "test_model", "judge": ["judge_model", "single-math-v1"], "user_prompt": "[Instruction]\nPlease act as an impartial judge and evaluate the quality of the response provided by an AI assistant to the user question displayed below. Your evaluation should consider correctness and helpfulness. You will be given a reference answer and the assistant's answer. Begin your evaluation by comparing the assistant's answer with the reference answer. Identify and correct any mistakes. If correct, an assistant's answer that follows a similar style of the reference answer is preferable. Do not bias to any particular style that does not appear in the reference answer. Be as objective as possible. After providing your explanation, you must rate the response on a scale of 1 to 10 by strictly following this format: \"[[rating]]\", for example: \"Rating: [[5]]\".\n\n[Question]\nJane runs faster than Joe. Joe runs faster than Sam. Does Sam run faster than Jane?\n\n\
apiVersion: v1
kind: Namespace
metadata:
name: tyler-vm
---
apiVersion: v1
kind: Service
metadata:
name: app-1
namespace: tyler-vm
apiVersion: v1
kind: Namespace
metadata:
name: tyler-vm
---
apiVersion: v1
kind: Service
metadata:
name: app-1
namespace: tyler-vm