Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import unicodedata | |
def normalize_string(string): | |
if isinstance(string, str): | |
nfkd_form = unicodedata.normalize('NFKD', string.lower()) | |
return nfkd_form.encode('ASCII', 'ignore').decode('utf-8') |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# Candidatos | |
http://agencia.tse.jus.br/estatistica/sead/odsele/consulta_cand/consulta_cand_2016.zip | |
http://agencia.tse.jus.br/estatistica/sead/odsele/bem_candidato/bem_candidato_2016.zip | |
http://agencia.tse.jus.br/estatistica/sead/odsele/consulta_legendas/consulta_legendas_2016.zip | |
http://agencia.tse.jus.br/estatistica/sead/odsele/consulta_vagas/consulta_vagas_2016.zip | |
http://agencia.tse.jus.br/estatistica/sead/odsele/motivo_cassacao/motivo_cassacao_2016.zip | |
http://agencia.tse.jus.br/estatistica/sead/odsele/consulta_cand/consulta_cand_2014.zip | |
http://agencia.tse.jus.br/estatistica/sead/odsele/bem_candidato/bem_candidato_2014.zip |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
id | city | state_id | state_name | state | |
---|---|---|---|---|---|
1100015 | Alta Floresta D'Oeste | 11 | Rondônia | RO | |
1100023 | Ariquemes | 11 | Rondônia | RO | |
1100031 | Cabixi | 11 | Rondônia | RO | |
1100049 | Cacoal | 11 | Rondônia | RO | |
1100056 | Cerejeiras | 11 | Rondônia | RO | |
1100064 | Colorado do Oeste | 11 | Rondônia | RO | |
1100072 | Corumbiara | 11 | Rondônia | RO | |
1100080 | Costa Marques | 11 | Rondônia | RO | |
1100098 | Espigão D'Oeste | 11 | Rondônia | RO |
We can't make this file beautiful and searchable because it's too large.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
achievement,achievement_code,additional_value,author,author_page_summary,category,department,destination,file_generation_date,file_generation_time,intervention,intervention_code,justification,location,number,proposed_wording,reference,total_page_summary,type,commitment_info_url,url | |
Implantação/Aparelham/Adequação Unid Saúde/ Aquis Unid Móvel,552,5.343.000,3230 - Jaime Martins,1 de 13,Individual,Saúde,ESPELHO DE EMENDA DE APROPRIAÇÃO DE DESPESA,2013-12-02,22:03,Atenção Especializada:Hospitais/Policlínicas/Unid.Especializ,003,,3100000 - Minas Gerais,32300001,,,3497 de 8807,Apropriação - Inclusão,http://inteligenciadenegocios3.camara.gov.br/painel/redirectorcamento.jsp?urlbo=iDocID=79334%26sOutputFormat=P%26sRefresh=Y%26lsSANO=2009%26lsSMES=12%26lsSORGAO=%26lsSUO=55901%26lsSACAO=2B31%26lsSSUBTITULO=0031,http://www.camara.gov.br/internet/comissao/index/mista/orca/orcamento/or2009/emendas/despesa/DANIELRJ_AV_LOA_AUTOR2_3230.pdf | |
Implantação/Aparelham/Adequação Unid Saúde/ Aquis Unid Móvel,552,2.000.000,3230 - Jaime |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
category | value | year | |
---|---|---|---|
Encargos especiais | 2100000000000 | 2017 | |
Previdência social | 650500000000 | 2017 | |
Educação | 110700000000 | 2017 | |
Saúde | 107300000000 | 2017 | |
Assistência social | 83200000000 | 2017 | |
Trabalho | 74800000000 | 2017 | |
Defesa nacional | 68400000000 | 2017 | |
Reserva de contingência | 60200000000 | 2017 | |
Agricultura | 35800000000 | 2017 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
$ pgloader --debug --verbose mysql://root:root-password@localhost/meumunicipio postgresql:///meumunicipio:meumunicipio-password@localhost/ | |
sb-impl::*default-external-format* :UTF-8 | |
tmpdir: #P"/var/folders/y_/mkfhjlmd7yjfmsj1m5l8r4m80000gn/T/" | |
2017-07-18T12:10:43.015000+01:00 NOTICE Starting pgloader, log system is ready. | |
2017-07-18T12:10:43.023000+01:00 INFO Starting monitor | |
2017-07-18T12:10:43.025000+01:00 LOG Main logs in '/private/tmp/pgloader/pgloader.log' | |
2017-07-18T12:10:43.025000+01:00 LOG Data errors in '/private/tmp/pgloader/' | |
2017-07-18T12:10:43.025000+01:00 INFO SOURCE: "mysql://root:root-password@localhost/meumunicipio" | |
2017-07-18T12:10:43.025000+01:00 INFO TARGET: "postgresql:///meumunicipio:meumunicipio-password@localhost/" | |
2017-07-18T12:10:43.025000+01:00 INFO Stopping monitor |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
We don’t store your data, period. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
from scrapy import project, signals | |
from scrapy.conf import settings | |
from scrapy.crawler import CrawlerProcess | |
from scrapy.xlib.pydispatch import dispatcher | |
from multiprocessing.queues import Queue | |
import multiprocessing | |
class CrawlerWorker(multiprocessing.Process): | |
def __init__(self, spider, result_queue): |