Skip to content

ybenjo/crawl_syobocal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 

Repository files navigation

Purpose

Crawl syoboi calendar(cal.syoboi.jp) to collect SEIYU(voice actor/actress)'s information.

Environments

  • ruby >= 1.9
  • gems: nokogiri, mongo
  • Mongodb

Usage

ruby crawl.rb [playing|old|ova|movie|radio|hero|all]
'all' : crawling all data

config.yaml

  • address: address of mongodb.
  • port: port number of mongodb.
  • db: name of mongodb's db.
  • collection: name of mongodb's collection.
  • sleep: crawl interval(sec).

Output

Each mongodb's document contains these values.

  • title: Animation's title.
  • date: Date when this animations start.
  • last_update: Time when this article(in syobocal) updated.
  • url: URL of this animation's article.
  • casts: Array of pairs that contains [character, seiyu].
    • Array is sorted by Calender's order(maybe importance).
  • director: Director who made.
  • studio: Animation studio which made.

About

crawl syobocal(http://cal.syoboi.jp/)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages