Skip to content

Instantly share code, notes, and snippets.

@sasamijp
sasamijp / q42
Last active August 29, 2015 14:00
死ね
gets
member = {
'J' => 1,
'O' => 2,
'I' => 4
}
def dumpArray(array)
array.each do |value|
@sasamijp
sasamijp / study.rb
Created August 6, 2014 17:12
http://ssimas.blog.fc2.com/ の スレッドの本文を抽出する
# -*- encoding: utf-8 -*-
require 'nokogiri'
require "open-uri"
require './SSparser.rb'
require './db_manager.rb'
# http://ssimas.blog.fc2.com/
#url = 'http://ssimas.blog.fc2.com/blog-entry-3218.html#more'
@sasamijp
sasamijp / intelligence.rb
Created August 6, 2014 17:21
天海春香12号
# -*- encoding: utf-8 -*-
require './wakati.rb'
require './db_manager.rb'
class Intelligence
def initialize
@wakati = Wakati.new
db = DbManager.new
@responds = db.read("amami.db")
@sasamijp
sasamijp / teach.rb
Created August 8, 2014 09:40
dbからランダムに選んできて手動で5パターンに分類して教師データを作る
# -*- encoding: utf-8 -*-
require 'classifier'
require './db_manager.rb'
require 'natto'
@nm = Natto::MeCab.new
def wakati(str)
array = []
@nm.parse(str) do |n|
@sasamijp
sasamijp / collect_url.rb
Created September 28, 2014 11:32
アーカイブのurl全部取得してdbに書き込む
# -*- encoding: utf-8 -*-
require 'nokogiri'
require "open-uri"
require 'sequel'
def insert(dbname, urls)
db = Sequel.connect("sqlite://#{dbname}")
urls.each do |url|
db[:url].insert(value: url)
@sasamijp
sasamijp / extractcontent.rb
Created September 28, 2014 11:35
dbからurlを読み取ってブログの記事の内容をnokogiriでパースしてから会話コーパスに変換してdbに書き込む
# -*- encoding: utf-8 -*-
require 'nokogiri'
require 'open-uri'
require 'Sequel'
require './SSparser.rb'
# for ankake.blog.jp
s = SSparser.new
@sasamijp
sasamijp / ssparser.rb
Created September 28, 2014 11:40
SSから会話コーパスへの変換をするクラス
# -*- encoding: utf-8 -*-
class SSparser
def parse(body)
(body.count('『') > body.count('「')) ?
body.gsub('「', '「').gsub('『', '「').gsub('』', '」').gsub('」', '」') :
body.gsub('「', '「').gsub('」', '」')
body = body.split("\n").delete_if{|v|v.nil?}
ss = []
@sasamijp
sasamijp / chimpo.rb
Created September 29, 2014 08:26
おちんぽ無限回廊
# encoding: utf-8
array = ["ち","ん","ぽ"]
c = 1
narray = []
1000.times do
narray.push array.sample
end
@sasamijp
sasamijp / ss_analyzer.rb
Created October 25, 2014 08:18
SSparserでパースされたSSがコーパスとして適しているか判定する
# -*- encoding: utf-8 -*-
class SSAnalyzer
def corpus?(ss)
sla = sentence_length_average(ss)
ctc = consecutive_talking_count(ss)
return false if ctc.nil?
ctc = ctc/ss.length.to_f
(ctc >= 0.8) or (ctc >= 0.4 and sla <= 20)
@sasamijp
sasamijp / name.csv
Last active August 19, 2018 14:28
update_nameによって蓄積された名前ログ
sasamijp 2014-03-24 23:08:33 +0900 448099043776794625
おしり sasamijp 2014-03-25 08:58:06 +0900 448247409835913216
おしりターボ firstspring1845 2014-03-25 08:58:31 +0900 448247511027695616
極左 NTTrf 2014-03-25 09:35:42 +0900 448256867165159424
アナル sasamijp 2014-03-25 10:04:17 +0900 448264063898746881
菊地真はホモ 1kaguya 2014-03-25 10:04:44 +0900 448264173814702080
ちんちん sasamijp 2014-03-25 10:11:35 +0900 448265900936798209
sasamijp 2014-03-25 10:40:12 +0900 448273101990555649
アナル sasamijp 2014-03-25 16:28:21 +0900 448360718526447616
ちんちん sasamijp 2014-03-25 17:07:55 +0900 448370672822013952