Tag: mechanize ruby

如何从Mechanize :: File对象转换为Mechanize :: Page对象?

我有一个登录表单的页面。 登录后有一些重定向。 第一个看起来像这样: #”no-cache=\”set-cookie\””, “content-length”=>”114”, “set-cookie”=>”JSESSIONID=GdJnPVnhtN91KZfQPc3QzM1NLCyWDsnyvpGg8LL0Knnz3RgqxLFs!1803804592!-2134626567; path=/; secure, COOKIE_TEST=Aslyn; secure”, “x-powered-by”=>”Servlet/2.4 JSP/2.0″}, @body=”\r\n\r\n \r\n \r\n \r\n”, @uri=#> 所以当我在这里做一个page.class时,我得到了 Mechanize::File 如何将其转换为Mechanize::Page ? @pguardiario 为了更好地解释我的原始消息中的代码存储在页面中。 当我做page.class时,我得到Mechanize :: File 那么我就执行上面的代码: agent = Mechanize.new agent.post_connect_hooks << lambda {|http| http[:response].content_type = 'text/html'} 所以我这样做:agent.get(page.uri.to_s)或事件尝试使用任何url agent.get(“ https://www.manageyourloans.com/MYL ”)我收到一个错误:ArgumentError:错误的参数数量(4对1) 我甚至试过这个: agent = Mechanize.new { |a| a.post_connect_hooks << lambda { |_,_,response,_| if response.content_type.nil? || […]

得到错误“getaddrinfo:没有这样的主机是已知的。 (Socke tError)“机械化gem

我尝试了下面的代码: require ‘mechanize’ agent = Mechanize.new{|a| a.ssl_version, a.verify_mode = ‘SSLv3′, OpenSSL::SSL::VERIFY_NONE} page = agent.get “https://gegsltraining.aravo.com/” page=page.link_with(:dom_class => “button”).click() 但是我的错误得到了以下错误。 D:\WIPData\Ruby\Scripts>mechanize_dowload.rb C:/Ruby193/lib/ruby/gems/1.9.1/gems/net-http-persistent-2.8/lib/net/http/persist ent/ssl_reuse.rb:29:in `initialize’: getaddrinfo: No such host is known. (Socke tError) from C:/Ruby193/lib/ruby/gems/1.9.1/gems/net-http-persistent-2.8/lib/net /http/persistent/ssl_reuse.rb:29:in `open’ from C:/Ruby193/lib/ruby/gems/1.9.1/gems/net-http-persistent-2.8/lib/net /http/persistent/ssl_reuse.rb:29:in `block in connect’ from C:/Ruby193/lib/ruby/1.9.1/timeout.rb:54:in `timeout’ from C:/Ruby193/lib/ruby/1.9.1/timeout.rb:99:in `timeout’ from C:/Ruby193/lib/ruby/gems/1.9.1/gems/net-http-persistent-2.8/lib/net /http/persistent/ssl_reuse.rb:29:in `connect’ from C:/Ruby193/lib/ruby/1.9.1/net/http.rb:755:in `do_start’ from […]

使用www :: mechanize时的Iconv :: IllegalSequence

我正在尝试做一些webscraping,但WWW:Mechanize gem似乎不喜欢编码和崩溃。 post请求导致302重定向(跟随机械化,到目前为止很好),结果页面似乎崩溃了。 我google了很多,但到目前为止没有任何问题可以解决这个问题。 你们中有人有个主意吗? 码: require ‘rubygems’ require ‘mechanize’ agent = WWW::Mechanize.new agent.user_agent_alias = ‘Mac Safari’ answer = agent.post(‘https://www.budget.de/de/reservierung/privatkunden/step1/schnellbuchung’, {“Country” => “Deutschland”, “Abholstation” => “Aalen”, “Abgabestation” => “Aalen”, “Abholdatum” => “26.02.2009”, “Abholzeit_stunde” => “13”, “Abholzeit_minute” => “30”, “Abgabedatum” => “28.02.2009”, “Abgabezeit_stunde” => “13”, “Abgabezeit_minute” => “30”, “CountryID” => “DE”, “AbholstationID”=>”AA1”, “AbgabestationID”=>”AA1” } ) puts […]

我无法从Nokogiri解析的字符串中删除空格

我无法从字符串中删除空格。 我的HTML是: Cena pro Vás: 139 Kč 我的代码是: #encoding: utf-8 require ‘rubygems’ require ‘mechanize’ agent = Mechanize.new site = agent.get(“http://www.astratex.cz/podlozky-pod-raminka/doplnky”) price = site.search(“//p[@class=’your-price’]/strong/text()”) val = price.first.text => “139 ” val.strip => “139 ” val.gsub(” “, “”) => “139 ” gsub , strip等不起作用。 为什么,我该如何解决这个问题? val.class => String val.dump => “\”139\\u{a0}\”” ! val.encoding => # __ENCODING__ => # […]