Skip to content

Instantly share code, notes, and snippets.

@zippera
zippera / diandian.py
Last active December 20, 2015 06:29
python爬虫,下载点点「美女」标签下的大图。 更新:缩短文件名;排除杂图;改善提示信息 使用方法:新建一个文件夹,把代码保存为name.py文件,运行 「python name.py」就可以把图片下载到文件夹。 计划:使用压缩;使用多线程;创建相对路径文件夹。
# -*- coding: utf-8 -*-
#---------------------------------------
# 程序:点点美女图片爬虫
# 版本:0.2
# 作者:zippera
# 日期:2013-07-26
# 语言:Python 2.7
# 说明:能设置下载的页数
@zippera
zippera / haotu.py
Last active December 20, 2015 07:39
好图爬虫
from urllib2 import urlopen, URLError
from urllib import urlretrieve
import threading
import re
import os
downroot = r'D:\luguan'
p1 = re.compile(r'src="img.+jpg')
p2 = re.compile(r'\d{1,}\.jpg')
prefix = 'http://www.lu-guan.com/xxoo/'
# coding: utf-8
import urllib2
import urllib
import re
pat = re.compile(r'<div.*?class="content".*?ti.*?>(.*?)</div>', re.S)#有问题
nexturl1 = "http://www.qiushibaike.com/month/page/"
Error in user YAML: (<unknown>): found character that cannot start any token while scanning for the next token at line 2 column 1
---
# Python 简介
@su27 and @menghan
---

What is Python?

Python: 优雅而健壮的编程语言

  • 高级

  • 易学易读易维护

@zippera
zippera / mlook.py
Last active December 21, 2015 05:49
#encoding=utf-8
import requests as rq
from bs4 import BeautifulSoup as bs
lgurl = 'http://mlook.mobi/member/login'
host = 'http://mlook.mobi'
tmpurl = 'http://mlook.mobi/book/info/6248'
hds = { 'User-Agent' : 'Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/28.0.1500.72 Safari/537.36',
'Referer':'http://mlook.mobi/member/login',
@zippera
zippera / mlook_ebooks.py
Last active October 17, 2017 07:18
依赖第三方库:requests和BeautifulSoup4 只下载mobi和epub格式电子书
#coding=utf-8
##版本:1.0
##环境:python2.7
##作者:moxie
##日期:2013.08.18
##第三方依赖:requests和BeautifulSoup4
import re
import requests as rq
@zippera
zippera / v2ex_topx.py
Last active December 21, 2015 08:18
这个代码可以抓取v2ex回复数量前N的帖子。统计的页数和Top页数都可以指定。
#coding=utf8
##版本:1.0
##环境:python2.7
##作者:moxie
##日期:2013.08.20
import urllib2
import re
import threading
#coding=utf8
##版本:1.0
##环境:python2.7
##作者:moxie
##日期:2013.08.23
##说明:文件生成目录为:D:/ludatui
import urllib2,urllib
import re
#Newbie programmer
def factorial(x):
if x == 0:
return 1
else:
return x * factorial(x - 1)
print factorial(6)
#First year programmer, studied Pascal
@zippera
zippera / bow.py
Created December 16, 2013 04:22
人工智能作业,对图片分类。
#!/usr/bin/env python
#coding=utf-8
import os
from numpy import array
from scipy.cluster.vq import vq, kmeans, whiten, kmeans2
from sklearn import svm
from sklearn import cross_validation
ncluster = 6 #聚类数
########