Skip to content

Github mirror of "analytics/limn-wikidata-data" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing

Notifications You must be signed in to change notification settings

wikimedia/analytics-limn-wikidata-data

Repository files navigation

analytics/limn/wikidata-data

This repository actually has nothing to do with limn.

It instead contains a bunch of scripts collecting data for the Wikidata dashboards currently in Grafana.

All scripts in this repo have deliberately been written with NO external dependencies or libraries to mean deploying in places (such as potentially the WMF cluster) is super easy.

Configuration

Some of the social scripts require configuration settings to work.

These should be stored in a file called 'config' in the root of this repo.

The file should look something like the below:

facebook someHashKeyThing1
google someHashKeyThing2
mm-wikidata-pass password1
mm-wikidatatech-pass password2
mm-user foo@bar.baz

Running the scripts

These scripts should be triggered from a cron that looks something like this:

MAILTO=cron@domainname.org

# Run minutely
* * * * * ~/wikidata-data/minutely.sh

# Daily
0 3 * * * ~/wikidata-data/daily_datamodel.sh
0 4 * * * ~/wikidata-data/graphite/entityUsage.php
0 5 * * * ~/wikidata-data/daily_social.sh
30 5 * * * ~/wikidata-data/daily_misc.sh
0 6 * * * ~/wikidata-data/daily_site_stats.sh

# Logrotate is at 6:25, + time for rsync (hourly?), 12 gives us roughly 6 hours
# This MUST be run on stat1002
0 12 * * * ~/wikidata-data/graphite/api/logScanner.php

Graphite

Metrics are currently stored in the following paths in graphite:

wikidata.*
daily.wikidata.*

About

Github mirror of "analytics/limn-wikidata-data" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing

Resources

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •