Last active
August 29, 2015 14:07
-
-
Save monolar/8835598d59ef9d1a2d41 to your computer and use it in GitHub Desktop.
Celluloid-IO TCPSocket memleak/filehandle leak issue
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env ruby | |
require 'rubygems' | |
require 'bundler/setup' | |
require 'celluloid/io' | |
require 'chromatic' | |
require 'objspace' | |
class Client | |
include Celluloid::IO | |
def initialize | |
@socket = nil | |
@connecting_timer = nil | |
reconnect | |
run | |
end | |
def run | |
loop do | |
sleep 1 | |
end | |
end | |
def disconnect | |
return unless @socket | |
@socket.close | |
@socket = nil | |
end | |
def reconnect | |
puts "connecting ...".green | |
if @connecting_timer | |
@connecting_timer.cancel | |
@connecting_timer = nil | |
end | |
disconnect | |
@socket = TCPSocket.new('localhost', 55555) | |
rescue Exception => e | |
puts "error while connecting: #{e.inspect}:\n #{e.backtrace.join("\n ")}".red | |
disconnect | |
dump_objects | |
@connecting_timer = after(1) { | |
reconnect | |
} | |
end | |
def dump_objects | |
GC.start(full_mark: true, immediate_sweep: true) | |
puts ObjectSpace.count_objects[:T_OBJECT] | |
c = ObjectSpace.each_object(Socket) do |s| | |
s.close unless s.closed? | |
end | |
puts "#{c} sockets" | |
end | |
end | |
Client.new |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env ruby | |
require 'rubygems' | |
require 'bundler/setup' | |
require 'celluloid' | |
require 'celluloid/io' | |
class Manager | |
include Celluloid | |
trap_exit :actor_died | |
def initialize | |
connect | |
run | |
end | |
def actor_died(actor, reason) | |
c = ObjectSpace.each_object(Socket) { |s| } | |
puts "#{c} sockets" | |
after (1) { | |
connect | |
} | |
end | |
def connect | |
Connection.new_link.async.connect | |
end | |
def run | |
loop do | |
sleep 1 | |
end | |
end | |
end | |
class Connection | |
include Celluloid::IO | |
def connect | |
@socket = Celluloid::IO::TCPSocket.new('localhost', 5555) | |
# would actually do sensible stuff with the socket now... | |
end | |
end | |
Manager.new |
Dammit - i accidently deleted the first comment in a coffee induced coma:
Here is it again:
Output is something like
connecting ...
error while connecting: #<Errno::ECONNREFUSED: Connection refused - connect(2) for 127.0.0.1:55555>:
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-io-0.16.0/lib/celluloid/io/tcp_socket.rb:85:in `connect_nonblock'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-io-0.16.0/lib/celluloid/io/tcp_socket.rb:85:in `initialize'
bin/demo.rb:44:in `new'
bin/demo.rb:44:in `reconnect'
bin/demo.rb:21:in `initialize'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/calls.rb:26:in `public_send'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/calls.rb:26:in `dispatch'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/calls.rb:63:in `dispatch'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/cell.rb:60:in `block in invoke'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/cell.rb:71:in `block in task'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/actor.rb:357:in `block in task'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/tasks.rb:57:in `block in initialize'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/tasks/task_fiber.rb:15:in `block in create'
411
1 sockets
connecting ...
error while connecting: #<Errno::ECONNREFUSED: Connection refused - connect(2) for 127.0.0.1:55555>:
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-io-0.16.0/lib/celluloid/io/tcp_socket.rb:85:in `connect_nonblock'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-io-0.16.0/lib/celluloid/io/tcp_socket.rb:85:in `initialize'
bin/demo.rb:44:in `new'
bin/demo.rb:44:in `reconnect'
bin/demo.rb:50:in `block in reconnect'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/actor.rb:357:in `block in task'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/tasks.rb:57:in `block in initialize'
/Users/andreas/.rvm/gems/ruby-2.1.3@tcp_socket_demo/gems/celluloid-0.16.0/lib/celluloid/tasks/task_fiber.rb:15:in `block in create'
421
2 sockets
and so on. The left-over filehandles are removed by an ugly hack (s.close unless s.closed?) via ObjectSpace
I have something like:
finalizer :terminate_myactor
def terminate_myactor
begin
@socket.close if [email protected]? && [email protected]? && [email protected]?
rescue Celluloid::Task::TerminatedError
begin
@socket.close if [email protected]? && [email protected]?
rescue Exception
# at this point we can assume things are cleaned up enough
end
end
end
The operating system will eventually release the socket file handles in the process, after they've been in CLOSE_WAIT for awhile.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I added another demo (demo2.rb) which seems to solve this issue. Basically the connection is handled in one actor which is linked to another actor, which takes care of reconnecting.
What is not detailed here is the actual listening and communication between those two actors to actually do something sensible with the socket, e.g. protocol.
This new example produces an output like
This shows that the GC now actually kicks in. While looking at the filehandles it can be nicely seen that they are closed properly on their own...although it can take a while.