Last active
March 27, 2022 09:39
-
-
Save stammy/790971 to your computer and use it in GitHub Desktop.
Import a WordPress database and generate markdown files for Jekyll
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# View my Jekyll blog http://paulstamatiou.com and my jekyll migration post http://paulstamatiou.com/how-to-wordpress-to-jekyll/ | |
# | |
# | |
# based on the import script by icebreaker, which is based on mojombo's | |
# https://github.com/mojombo/jekyll/blob/master/lib/jekyll/migrators/wordpress.rb | |
# https://gist.github.com/303570 | |
# edited to rewrite image URLs to use my CloudFront URL | |
require 'rubygems' | |
require 'sequel' | |
require 'fileutils' | |
# $ export DB=my_wpdb | |
# $ export USER=dbuser | |
# $ export PASS=dbpass | |
# $ ruby -r './lib/jekyll/migrators/wordpress' -e 'Jekyll::WordPress.process( "#{ENV["DB"]}", "#{ENV["USER"]}", "#{ENV["PASS"]}")' | |
# NOTE: This converter requires Sequel and the MySQL gems. | |
# The MySQL gem can be difficult to install on OS X. Once you have MySQL | |
# installed, running the following commands should work: | |
# $ sudo gem install sequel | |
# $ sudo gem install mysql -- --with-mysql-config=/usr/local/mysql/bin/mysql_config | |
module Jekyll | |
module WordPress | |
# Reads a MySQL database via Sequel and creates a post file for each | |
# post in wp_posts that has post_status = 'publish'. | |
# This restriction is made because 'draft' posts are not guaranteed to | |
# have valid dates. | |
QUERY = "select post_title, post_name, post_date, post_content, post_excerpt, ID, guid from wp_posts where post_status = 'publish' and post_type = 'post'" | |
# Fetch all tags for a given POST ID | |
TAGS_QUERY = "select tm.term_id,tm.name from wp_term_relationships tr | |
inner join wp_term_taxonomy tt on tr.term_taxonomy_id = tt.term_taxonomy_id | |
inner join wp_terms tm on tm.term_id=tt.term_id | |
where tr.object_id=%d and tt.taxonomy = 'post_tag'"; | |
def self.process(dbname = '', user='', pass='', host = 'localhost', domain = '') | |
db = Sequel.mysql(dbname, :user => user, :password => pass, :host => host) | |
FileUtils.mkdir_p "_posts" | |
db[QUERY].each do |post| | |
# Get required fields and construct Jekyll compatible name | |
title = post[:post_title] | |
slug = post[:post_name] | |
date = post[:post_date] | |
content = post[:post_content] | |
name = "%02d-%02d-%02d-%s.markdown" % [date.year, date.month, date.day, slug] | |
# Get associated taxonomy terms (tags) | |
# We replace + with nothing and transform to lower case | |
# TODO: figure out what other characters would fuck up YAML | |
tags = [] | |
db[TAGS_QUERY % post[:ID]].each do |tag| | |
tags << tag[:name].to_s.gsub('+','').downcase | |
end | |
# Process content to rewrite some URLs | |
if domain | |
content = self.transformUrls(domain,content) | |
end | |
# Get the relevant fields as a hash, delete empty fields and convert | |
# to YAML for the header | |
data = { | |
'layout' => 'post', | |
'title' => title.to_s, | |
'excerpt' => post[:post_excerpt].to_s, | |
'tags' => tags | |
}.delete_if { |k,v| v.nil? || v == ''}.to_yaml | |
# Write out the data and content to file | |
File.open("_posts/#{name}", "w") do |f| | |
f.puts data | |
f.puts "---" | |
f.puts content | |
end | |
end | |
end | |
# Process the content and replace URLs pointing to wp-content/uploads/ | |
# with CloudFront CNAME'd URL turbo.paulstamatiou.com/uploads/ | |
def self.transformUrls(domain,content) | |
baseurl = "%s/wp-content/uploads/" % domain | |
return content.gsub(baseurl,"turbo.paulstamatiou.com/uploads/") | |
end | |
end | |
end |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
@stammy:
With respect to the lines 18–22 of this Gist as quoted here:
the instructions continued within it no longer seem to work. Specifically, the last line, number 22, does not seem to lend itself well to successful invocation. With Ruby v2.4.3 and MySQL v5.7.20 (both installed by Homebrew,) attempting to execute said command — or, rather, a variation thereupon in the form of
sudo gem install --no-user-install mysql -- --with-mysql-config="/usr/local/bin/mysql_config"
— results in the following tirade of warnings and errors:The same is also true with respect to the equivalent lines of @harperreed's migration script, which you reference in the blog post with which this Gist goes. I suspect that this is due to changes in Ruby and MySQL that have not been taken into account within the Ruby
mysql
Gem's codebase, as its author stopped maintaining it a couple years ago.