중요도 높은 블로그

https://freshrimpsushi.github.io/ # 수학,수리통계 전반
https://darkpgmr.tistory.com/103?category=460967 # 선형대수 관련
http://wolfpack.hannam.ac.kr/ # 무친 무친 무친 무친

추천도서

https://python-guide-kr.readthedocs.io/ko/latest/

VIF

모델 시각화

Evaluation Metrics

평가지표들마다 약점들을 잘 파악할것
- MSE
- RMSE
- MAE
- R-squared

Bias Variance Trade off

해결하고자하는 문제의 성격에 따라 bias variance tradeoff의 비율이 달라진다.
Model complexity가 높아질 수록 과적합이 잘 일어난다.
일반화가 잘된 모델은 과적합이 덜된 모델
https://datacookbook.kr/48
https://modulabs-biomedical.github.io/Bias_vs_Variance
https://bywords.tistory.com/entry/%EB%B2%88%EC%97%AD-%EC%9C%A0%EC%B9%98%EC%9B%90%EC%83%9D%EB%8F%84-%EC%9D%B4%ED%95%B4%ED%95%A0-%EC%88%98-%EC%9E%88%EB%8A%94-biasvariance-tradeoff
https://rfriend.tistory.com/189

다항회귀

차수를 올릴수록 모델의 복잡도가 커진다.
모델의 복잡도가 올라가다가 어느 시점에서 과적합이 발생한다.

Dictionary의 Get함수

Gaus - Marcov Theorem : 오차항의 조건

최적화

Classification

https://medium.com/quick-code/regression-versus-classification-machine-learning-whats-the-difference-345c56dd15f7

오늘 무조건!! -> getting the name of a variable as a string

int days to year

Logistic Regression

매우 매우 매우 중요
Classification
범주의 비율의 차이가 많이 날 경우 Accuracy를 쓰는 것이 부정확하다.
분류이긴 한데 결과가 확률로 나온다. -> 회귀모델처럶 해석가능하다.
기본적으로 GLM의 하위 관점에서 로지스틱 회귀를 바라보는 것이 중요하다.
특성이 1단위 증가할 때 마다 확률이 ~% 증가한다. 라고 해석한다
- https://yngie-c.github.io/machine%20learning/2020/04/19/Logistic_Regression/
- https://ratsgo.github.io/machine%20learning/2017/07/02/logistic/ # 로지스틱 파라미터 추정
- https://ratsgo.github.io/machine%20learning/2017/04/02/logistic/ # 로지스틱 회귀
- https://seamless.tistory.com/23
math
- https://towardsai.net/p/machine-learning/logistic-regression-with-mathematics
https://towardsai.net/p/machine-learning/logistic-regression-with-mathematics
https://medium.com/analytics-vidhya/logistic-regression-b35d2801a29c

적률

구글에 '적률생성함수 정규분포' 검색할 것
https://freshrimpsushi.github.io/posts/expectation-mean-variance-moment/
http://blog.naver.com/PostView.nhn?blogId=mykepzzang&logNo=220846464280
https://hsm-edu.tistory.com/1198
https://hsm-edu.tistory.com/756
https://freshrimpsushi.github.io/categories/%EC%88%98%EB%A6%AC%ED%86%B5%EA%B3%84%ED%95%99/

feature selection

https://scikit-learn.org/stable/modules/feature_selection.html

트러블슈팅

ydataai/ydata-profiling#233 # pandas profiling안될때 : 삭제후 재설치

GLM

Training ,Validating, Testing

경사하강법

https://angeloyeo.github.io/2020/08/16/gradient_descent.html

ADAM

https://www.youtube.com/watch?v=JXQT_vxqwIs

Rig

람다를 높일수록 스코어가 점점 낮아지고 상위 피처만 남는다
릿지회귀는 일종의 Feature Selection의 역할을 한다.
다항함수의 경우 람다를 높일 수록 스코어가 높아지고 어느 시점에서
https://datascienceschool.net/03%20machine%20learning/06.05%20%EC%A0%95%EA%B7%9C%ED%99%94%20%EC%84%A0%ED%98%95%ED%9A%8C%EA%B7%80.html
https://scikit-learn.org/stable/modules/linear_model.html # 공식문서 이론 매우매우 중요
https://riverzayden.tistory.com/15 # 이론 중요
https://blog.naver.com/wjddudwo209/220177096998
https://student9725.tistory.com/31
https://link.springer.com/referenceworkentry/10.1007%2F978-0-387-73003-5_1070
https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.RidgeCV.html
https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.Ridge.html # Ridge cv와 ridge 구분

mysql for jupyterlab

https://docs.kyso.io/guides/sql-interface-within-jupyterlab

Coding Convention

Split-Data

https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html?highlight=train%20test%20split#sklearn-model-selection-train-test-split

Encoding Style

fit tarnsform vs transform

https://deepinsight.tistory.com/165

Select K best

Unsorted

list comprehension

MSE

basic terms

Attribute and Parameter
- https://www.differencebetween.com/difference-between-attribute-and-vs-parameter/

F-string

https://www.datacamp.com/community/tutorials/f-string-formatting-in-python

####회귀

벡터공간

https://ratsgo.github.io/linear%20algebra/2017/05/20/spaces/

python

python-tricks

숫자표시형식_print_

https://www.geeksforgeeks.org/display-scientific-notation-as-float-in-python/

pandas

https://nicola-ml.tistory.com/23 #basics
https://stackoverflow.com/questions/28669482/appending-pandas-dataframes-generated-in-a-for-loop # df for-loop 쌓기
https://stackoverflow.com/questions/20461165/how-to-convert-index-of-a-pandas-dataframe-into-a-column # 시리스 데이터 DF로 변환,인덱스처리

numpy

https://stackoverflow.com/questions/12235552/r-function-rep-in-python-replicates-elements-of-a-list-vector # python rep()사용

시각화

https://www.python-graph-gallery.com/

꾸미기

http://seaborn.pydata.org/tutorial/aesthetics.html

annotating plots

과학적표기법

상관행렬

Perfomance Analitics 스타일 상관행렬 만들기

산점도

general

메서드 소스 보고싶을 때

https://bryan7.tistory.com/826 # inspect를 활용한 메서드 소스 보기
https://stackoverflow.com/questions/427453/how-can-i-get-the-source-code-of-a-python-function # 더많은 내용

정규표현식

https://bradbury.tistory.com/47

주피터관련

ipynb 파일 pdf로 바꾸기

https://stackoverflow.com/questions/15998491/how-to-convert-ipython-notebooks-to-pdf-and-html

markdown

nvim markdown설정

https://jdhao.github.io/2019/01/15/markdown_edit_preview_nvim/

git

https://lasdri.tistory.com/809 # ssh git push 관련 에러
https://parksb.github.io/article/28.html # 트러블슈팅

SQL

전처리

간단한 전처리

ML

시계열 예측분석

https://otexts.com/fppkr/

선형회귀

기하적 관점에서의 선형회귀

rmse 쓰는 이유

https://data101.oopy.io/mae-vs-rmse

baseline에 대한 이해

https://blog.ml.cmu.edu/2020/08/31/3-baselines/

Unsupervised Learning

PCA

Clustering

t-SNE

Math

선형대수

https://darkpgmr.tistory.com/103 # 기초부터 활용
https://cran.r-project.org/web/packages/matlib/vignettes/inv-ex1.html # inverse matrix in R

행렬과 선형변환

https://ratsgo.github.io/linear%20algebra/2017/05/21/determinants/ # 행렬식
https://angeloyeo. # github.io/2019/07/15/Matrix_as_Linear_Transformation.html # 행렬과 선형변환
https://losskatsu.github.io/linear-algebra/linear-trans/#

yjinheon/reference_link.md

중요도 높은 블로그

추천도서

VIF

모델 시각화

Evaluation Metrics

Bias Variance Trade off

다항회귀

Dictionary의 Get함수

Gaus - Marcov Theorem : 오차항의 조건

작업환경 구성 (R,Python) 매우 매우 매우 매우 중요

간단한 트릭들 unsorted

스케일러

최적화

Classification

오늘 무조건!! -> getting the name of a variable as a string

int days to year

Logistic Regression

적률

feature selection

트러블슈팅

GLM

Training ,Validating, Testing

경사하강법

ADAM

Rig

mysql for jupyterlab

Coding Convention

Split-Data

Encoding Style

fit tarnsform vs transform

Select K best

Unsorted

list comprehension

MSE

basic terms

F-string

벡터공간

python

python-tricks

숫자표시형식_print_

pandas

numpy

시각화

꾸미기

annotating plots

과학적표기법

상관행렬

산점도

general

메서드 소스 보고싶을 때

정규표현식

주피터관련

ipynb 파일 pdf로 바꾸기

markdown

nvim markdown설정

git

SQL

전처리

ML

시계열 예측분석

선형회귀

rmse 쓰는 이유

baseline에 대한 이해

Unsupervised Learning

PCA

Clustering

t-SNE

Math

선형대수

행렬과 선형변환

요약

깃 버전관리 관례

References

pandas 함수들 (pandas utility)

Missing Data

DataFrame Handling

Visualization