newly

DSL that helps scrapping news given a feed definition with url and selectors

SYNOPSIS:

# Fecthing breaking news from some website
require 'newly'

# Fecthing breaking news from some website
my_feed = Newly::Feed.new(
                        container: '#ultimas-regiao div, #ultimas-regiao ul li',
                        href: 'a',
                        title: '.titulo',
                        subtitle: '.subtitulo',
                        image_source: 'img')

news = Newly::NewsCrawler.new(url: 'http://g1.globo.com/bahia/', feed: my_feed).fetch

Getting data:

news.each do |n|
  puts n.url # news href url
  puts n.title # news title
  puts n.subtitle # news subtitle
  puts n.image # news image src
end

or

news.each do |n|
  puts n.to_hash # { url: '...', title: '...' }
end

Contributing to newly