rubyopen-uri

Is there a workaround to open URLs containing underscores in Ruby?


I'm using open-uri to open URLs.

resp = open("http://sub_domain.domain.com")

If it contains underscore I get an error:

URI::InvalidURIError: the scheme http does not accept registry part: sub_domain.domain.com (or bad hostname?)

I understand that this is because according to RFC URLs can contain only letters and numbers. Is there any workaround?


Solution

  • This looks like a bug in URI, and uri-open, HTTParty and many other gems make use of URI.parse.

    Here's a workaround:

    require 'net/http'
    require 'open-uri'
    
    def hopen(url)
      begin
        open(url)
      rescue URI::InvalidURIError
        host = url.match(".+\:\/\/([^\/]+)")[1]
        path = url.partition(host)[2] || "/"
        Net::HTTP.get host, path
      end
    end
    
    resp = hopen("http://dear_raed.blogspot.com/2009_01_01_archive.html")