mirror of
https://github.com/felipecsl/wombat.git
synced 2026-06-17 08:56:44 -07:00
No results
23
Changelog
Felipe Lima edited this page 2015-03-04 23:33:16 -08:00
version 2.4.0
- Updates gem dependencies
- Adds
user_agentanduser_agent_aliasconfig options toWombat.configure
version 2.3.0
- Updates gem dependencies
- Adds content-type=text/html header to Mechanize if missing
- Retry page.click on relative links
version 2.2.1
- Adds ability to crawl a prefetched Mechanize page (thanks to @dsjbirch)
version 2.1.2
- Added support for hash based property selectors (eg.:
css: 'header'instead of'css=.header')
version 2.1.1
- Updated gem dependencies
version 2.1.0
- Added header properties (thanks to @kdridi)
- Fixed bug in selectors that used XPath functions like
concat(thanks to @viniciusdaniel)
version 2.0.1
- Added proxy settings configuration (thanks to @phortx)
- Fixed minor bug in HTML property locator
version 2.0.0
This version contains some breaking changes (not backwards compatible), most notably to for_each that is now specified through the option :iterator and nested block parameters that are gone.
- Added syntatic sugar methods
Wombat.scrapeandCrawler#scrapethat alias to their respectivecrawlmethod implementation; - Gem internals suffered big refactoring, removed code duplication;
- DSL syntax simplified for nested properties. Now the nested block takes no arguments;
- DSL syntax changed for iterated properties. Iterators can now be named just like other properties and won't be automatically named as
iterator#{i}anymore. Specified through the:iteratoroption; Crawler#list_pageis now calledCrawler#path;- Added new
:followproperty type that crawls links in pages.
version 1.0.0
- Breaking change:
Metadata#formatrenamed toMetadata#document_formatdue to method name clash with Kernel#format
version 0.5.0
- Fixed a bug on malformed selectors
- Fixed a bug where multiple calls to #crawl would not clean up previously iterated array results and yield repeated results
version 0.4.0
- Added utility method
Wombat.crawlthat eliminates the need to have a ruby class instance to use Wombat. Now you can use justWombat.crawland start working. The class based format still works as before though.
version 0.3.1
- Added the ability to provide a block to Crawler#crawl and override the default crawler properties for a one off run (thanks to @danielnc)