通过rails中的链接获取标题，内容

我刚开始学习rails。你能帮我理解解析一个链接吗？好的教程也会有所帮助……

问题是：

当您在Digg，Facebook等中提交链接时。在您说附加链接后，它会解析链接以获取特定url的标题，内容和图像。你能帮我解决一下类似的东西如何在rails中实现吗？

我看过饲料解析器，如feedzirra等，但他们似乎得到了完整的网站提供..不仅仅是我们正在寻找的链接..还是我在某处犯了错误？

非常感谢提前。

看起来你可能正在寻找像Pismo这样的东西： https ： //github.com/peterc/pismo

require 'pismo' # Load a Web page (you could pass an IO object or a string with existing HTML data along, as you prefer) doc = Pismo::Document.new('http://www.rubyinside.com/cramp-asychronous-event-driven-ruby-web-app-framework-2928.html') doc.title # => "Cramp: Asychronous Event-Driven Ruby Web App Framework" doc.author # => "Peter Cooper" doc.lede # => "Cramp (GitHub repo) is a new, asynchronous evented Web app framework by Pratik Naik of 37signals (and the Rails core team). It's built around Ruby's EventMachine library and was designed to use event-driven I/O throughout - making it ideal for situations where you need to handle a large number of open connections (such as Comet systems or streaming APIs.)" doc.keywords # => [["cramp", 7], ["controllers", 3], ["app", 3], ["basic", 2], ..., ... ]

图像警告是：

图像提取仅处理具有绝对URL的图像

ootoovak的答案是正确的，但我更喜欢使用mechanize它的替代方案。使用mechanize这对你有用：

 agent=Mechanize.new # Creates a new Mechanize Object agent.get("http://domain.de/page.html") # This fetches the page given as parameter agent.page.title # This will return the title of the page

要安装mechanize，只需在您的Gemfile添加Gemfile gem 'mechanize'并运行bundle install 。

 > Mechanize.new.get('http://google.com').title => "Google"

确保你require 'mechanize'或在你的Gemfile中添加gem’mechanize gem 'mechanize' 。

通过rails中的链接获取标题，内容

如何在中间件中设置一个可在我的所有应用程序中访问的变量？

为什么这个表达式会导致浮点错误？

在attr_accessor期间键入强制转换值

为什么OpenURI将10kb以下的文件视为StringIO？

如何用哈希值对数组求和

如何使用HTTParty实现此POST请求？

在Windows 8.1上安装json时出错

自动加载常量时检测到循环依赖性

转储YAML时如何强制使用双引号？

Sass – 安装时出错