Skip to content

Instantly share code, notes, and snippets.

View Witiko's full-sized avatar

Vít Starý Novotný Witiko

  • Brno, Czech Republic
View GitHub Profile
{
"embeddings": [
{
"tensorName": "The soft VSM with non-regularized word embeddings on the BBCSPORT dataset",
"tensorShape": [
737,
3
],
"tensorPath": "https://gist.githubusercontent.com/Witiko/faed3553a6eeed93edc9f592924620b7/raw/c21c12dc88807d32e5a79dae3644ca7f0c429b77/docsim-dense_scm-bbcsport-1-True-True-True-100--1.0-2-vectors.csv",
"metadataPath": "https://gist.githubusercontent.com/Witiko/faed3553a6eeed93edc9f592924620b7/raw/c21c12dc88807d32e5a79dae3644ca7f0c429b77/docsim-dense_scm-bbcsport-1-True-True-True-100--1.0-2-metadata.csv"
We can make this file beautiful and searchable if this error is corrected: No commas found in this CSV file in line 0.
-6.042105197906494 0.056784726679325104 1.9964343309402466
-1.623029112815857 4.63659143447876 6.253110408782959
1.66807222366333 3.268333911895752 -2.813706398010254
1.075560450553894 2.8373401165008545 4.059364318847656
2.2727134227752686 -3.114814281463623 -8.626872062683105
-11.0668306350708 1.0132794380187988 7.732623100280762
-3.431757926940918 -16.058778762817383 1.516681432723999
6.394355773925781 -12.617380142211914 3.0361790657043457
5.27213716506958 1.37405264377594 -14.297883033752441
-6.678133487701416 6.447080612182617 1.683692455291748
{
"embeddings": [
{
"tensorName": "The soft VSM with regularized word embeddings on the BBCSPORT dataset",
"tensorShape": [
737,
3
],
"tensorPath": "https://gist.githubusercontent.com/Witiko/c2fff1238207bb1e3ad7e48865d8ec3a/raw/bf3b0a954189f556dac07476650ddd6ef1d01d89/docsim-sparse_scm-bbcsport-1-True-True-True-100--1.0-2-vectors.csv",
"metadataPath": "https://gist.githubusercontent.com/Witiko/faed3553a6eeed93edc9f592924620b7/raw/c21c12dc88807d32e5a79dae3644ca7f0c429b77/docsim-dense_scm-bbcsport-1-True-True-True-100--1.0-2-metadata.csv"
We can make this file beautiful and searchable if this error is corrected: No commas found in this CSV file in line 0.
comp.sys.mac.hardware
talk.politics.guns
talk.religion.misc
rec.sport.baseball
comp.sys.ibm.pc.hardware
comp.os.ms-windows.misc
talk.politics.misc
talk.politics.mideast
comp.sys.mac.hardware
sci.electronics
We can't make this file beautiful and searchable because it's too large.
25.758832931518555 -12.827503204345703 2.0738046169281006
-13.7992525100708 4.20018196105957 17.279329299926758
-10.49587631225586 1.2940646409988403 -12.707731246948242
3.8611440658569336 -15.003565788269043 -16.053197860717773
-8.76230525970459 -4.271994113922119 9.464016914367676
13.71536922454834 -0.8401200771331787 18.73625946044922
-10.988614082336426 -1.2058801651000977 1.7590161561965942
-18.275808334350586 -19.137815475463867 -2.1024789810180664
7.625443458557129 -10.325056076049805 3.7469522953033447
18.870302200317383 9.250248908996582 5.36542272567749
We can make this file beautiful and searchable if this error is corrected: No commas found in this CSV file in line 0.
Electronics
Electronics
CDs and Vinyl
Books
Books
Books
CDs and Vinyl
Home and Kitchen
Electronics
Electronics
We can make this file beautiful and searchable if this error is corrected: No commas found in this CSV file in line 0.
14.763727188110352 -6.425334453582764 14.79475212097168
2.9103310108184814 4.284109115600586 11.086734771728516
7.588127136230469 29.49435806274414 -4.02618408203125
-4.813774108886719 -2.114610433578491 -6.975024700164795
-13.212491989135742 3.5963125228881836 10.37350082397461
-8.92398738861084 3.373044013977051 8.830453872680664
-11.391376495361328 1.298736572265625 -16.327795028686523
7.700194358825684 -12.252012252807617 -3.431995153427124
19.452924728393555 9.807990074157715 6.623404026031494
19.44965362548828 -9.774394035339355 10.746170997619629
{
"embeddings": [
{
"tensorName": "The soft VSM with regularized word embeddings on the AMAZON dataset",
"tensorShape": [
8000,
3
],
"tensorPath": "https://gist.githubusercontent.com/Witiko/2d63aa781e24d6f7c50278f6b0480c8e/raw/6fd7c0fa2f5726223b1727165d14c7118f2cfcd5/docsim-sparse_scm-amazon-1-False-True-True-100--1.0-1-vectors.csv",
"metadataPath": "https://gist.githubusercontent.com/Witiko/9343dec33ea3c3f6dac74b1a973b2131/raw/eb93f6b315139a4e17a67b6fbbf01b3e302f9ead/docsim-dense_scm-amazon-1-False-True-True-100--1.0-1-metadata.csv"
{
"embeddings": [
{
"tensorName": "The soft VSM with non-regularized word embeddings on the AMAZON dataset",
"tensorShape": [
8000,
3
],
"tensorPath": "https://gist.githubusercontent.com/Witiko/9343dec33ea3c3f6dac74b1a973b2131/raw/eb93f6b315139a4e17a67b6fbbf01b3e302f9ead/docsim-dense_scm-amazon-1-False-True-True-100--1.0-1-vectors.csv",
"metadataPath": "https://gist.githubusercontent.com/Witiko/9343dec33ea3c3f6dac74b1a973b2131/raw/eb93f6b315139a4e17a67b6fbbf01b3e302f9ead/docsim-dense_scm-amazon-1-False-True-True-100--1.0-1-metadata.csv"
{
"embeddings": [
{
"tensorName": "The soft VSM with non-regularized word embeddings on the 20NEWS dataset",
"tensorShape": [
9152,
3
],
"tensorPath": "https://gist.githubusercontent.com/Witiko/3143cead53253d8376a7fbbc45a61f6c/raw/f1bc30840178e8fed8550c0fb7f59137958ca6c3/docsim-dense_scm-20news-1-False-True-True-400--1.0-2-vectors.csv",
"metadataPath": "https://gist.githubusercontent.com/Witiko/3143cead53253d8376a7fbbc45a61f6c/raw/f1bc30840178e8fed8550c0fb7f59137958ca6c3/docsim-dense_scm-20news-1-False-True-True-400--1.0-2-metadata.csv"