Skip to content

Commit a691da7

Browse files
committed
Add doc section for multi-wiki dumps
Change-Id: I20202ef705c74d38e367ad62ec2e573d6cbd5286
1 parent c66aa77 commit a691da7

File tree

1 file changed

+24
-1
lines changed

1 file changed

+24
-1
lines changed

README.md

Lines changed: 24 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ Parsoid HTML dump script for RESTBase APIs like https://rest.wikimedia.org/.
55

66
`npm install`
77

8-
## Usage
8+
## Usage: Dumping a single wikis
99

1010
```
1111
Usage: node ./bin/dump_wiki
@@ -62,3 +62,26 @@ REATE TABLE data(
6262
PRIMARY KEY(title ASC, revision DESC)
6363
);
6464
```
65+
66+
## Usage: dumping all restbase wikis
67+
68+
You need to install `pixz`, which is used for parallel lzma / xz compression:
69+
70+
`apt-get install pixz`
71+
72+
With this in place, follow the instructions:
73+
74+
```bash
75+
# node bin/dump_restbase --help
76+
77+
Create HTML dumps in a directoy
78+
79+
Example usage:
80+
node ./bin/dump_restbase --workDir /tmp --dumpDir /tmp
81+
82+
Options:
83+
-h, --help Show help and exit.
84+
-v, --verbose Verbose logging
85+
--workDir Directory to use for in-progress dump files [default: "/tmp"]
86+
--dumpDir Directory to use for finished dump files [default: "/tmp"]
87+
```

0 commit comments

Comments
 (0)