Tag: scraping scrapyd

在’escape’中:未定义的方法`gsub’用于#(NoMethodError)

嗨,我想废弃一个网页“拿链接”转到那个链接和“废弃它”。 require ‘rubygems’ require ‘scrapi’ require ‘uri’ Scraper::Base.parser :html_parser web = “http://……” def sub_web(linksubweb) uri = URI.parse(URI.encode(linksubweb)) end scraper = Scraper.define do array :items process “div.mozaique>div”, :items => Scraper.define { process “p>a”, :title => :text process “div.thumb>a”, :link => “@href” result :title, :link, } result :items end uri = URI.parse(URI.encode(web)) scraper.scrape(uri).each do |pag| link_full = […]