Skip to content

Instantly share code, notes, and snippets.

@eric-czech
eric-czech / clouderprovider.py
Last active November 13, 2020 22:27
Dask Cloud Provider Usage for GCP
# Dask Cloud Provider REPL
#
# This is useful for creating clusters indepedent of the code that runs on them
# Example: python scripts/cloudprovider.py -- --interactive
#
from dask_cloudprovider.gcp.instances import GCPCluster
import fire
import os
import json
@eric-czech
eric-czech / pairwise_numba_options.ipynb
Last active October 6, 2020 13:46
pairwise_numba_options
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@eric-czech
eric-czech / pca_loadings_vs_scores.ipynb
Created September 12, 2020 16:22
PCA loadings vs scores
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@eric-czech
eric-czech / dask_svd_compressed_benchmark.ipynb
Created September 11, 2020 16:47
Dask SVD compressed array shape benchmark
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@eric-czech
eric-czech / dask_svd_benchmark.ipynb
Created September 9, 2020 17:09
Dask svd transpose and sign flip benchmark
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@eric-czech
eric-czech / pca_comparison.ipynb
Last active August 27, 2020 17:54
PCA Comparison (Scikit-allel vs Scikit-allel vs Dask-ML)
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@eric-czech
eric-czech / sgkit_is_segregating.ipynb
Last active August 10, 2020 21:26
Segregating variants w/ Xarray
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@eric-czech
eric-czech / dividx.py
Created July 23, 2020 19:23
Function to partition n elements into g groups (like np.array_split without materialization)
def dividx(n, groups):
"""Create index for groups that partition an array
The number of elements placed into a group will
either equal `n//groups` or `n//groups + 1`, depending
on how many of the latter are necessary to make
the partitioning complete.
Parameters
----------
@eric-czech
eric-czech / allele_count_graph.ipynb
Created July 20, 2020 10:28
Dask tasks for counting alleles
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@eric-czech
eric-czech / dask_genetic_data_simulation.ipynb
Last active May 23, 2020 16:18
Genetic data simulations with dask
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.