Skip to content

New Dynamic Extractors from Wikipedia Content with JSONpedia Faceted Browsing #356

@michelemostarda

Description

@michelemostarda
  • Clone the latest JSONpedia snapshot release from here: https://bitbucket.org/hardest/jsonpedia
    and read the documentation.
  • Install Elasticsearch 1.0.1 (subsequent versions have not yet been tested but you're welcome to do it).
  • Optionally install MongoDB 2.4.9 (subsequent versions have not yet been tested but you're welcome do to it).
  • Configure JSONpedia (conf/default/properties) to work properly with your installations of Elasticsearch and optionally MongoDB.
  • Process with JSONpedia the first 3 Wikipedia dumps [1] into Elasticsearch and optionally MongoDB (both manually or using the script bin/loader.py)
  • Load the faceted indexes using script bin/facet_loader.py
  • Run a local JSONpedia service [2] and verify the faceted browser in properly working.

[1] http://dumps.wikimedia.org/enwiki/latest/
[2] http://localhost:PORT/frontend/facet.html?source={"query":{"match_all":{}},"size":20}

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions