Skip to content

Commit 9d986e9

Browse files
authored
Merge pull request #59 from wikimedia/eu_hy_rebuild
Fixes references to eu and hy vector files and rebuild
2 parents 4ce74cb + 29c5235 commit 9d986e9

11 files changed

+2612
-2613
lines changed

Makefile

Lines changed: 6 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,6 @@ drafttopic_models: \
1515
models/kowiki.drafttopic.gradient_boosting.model \
1616
models/ukwiki.drafttopic.gradient_boosting.model \
1717
models/viwiki.drafttopic.gradient_boosting.model \
18-
models/wikidata.drafttopic.gradient_boosting.model
1918

2019
articletopic_models: \
2120
models/arwiki.articletopic.gradient_boosting.model \
@@ -405,7 +404,7 @@ word2vec/euwiki-20201201-learned_vectors.50_cell.10k.kv:
405404

406405
datasets/euwiki.balanced_article_sample.w_draft_cache.json: \
407406
datasets/euwiki.balanced_article_sample.w_draft_text.json \
408-
word2vec/euwiki-20200501-learned_vectors.50_cell.10k.kv
407+
word2vec/euwiki-20201201-learned_vectors.50_cell.10k.kv
409408
./utility extract_from_text \
410409
drafttopic.feature_lists.euwiki.drafttopic \
411410
--input=$< \
@@ -414,7 +413,7 @@ datasets/euwiki.balanced_article_sample.w_draft_cache.json: \
414413

415414
datasets/euwiki.balanced_article_sample.w_article_cache.json: \
416415
datasets/euwiki.balanced_article_sample.w_article_text.json \
417-
word2vec/euwiki-20200501-learned_vectors.50_cell.10k.kv
416+
word2vec/euwiki-20201201-learned_vectors.50_cell.10k.kv
418417
./utility extract_from_text \
419418
drafttopic.feature_lists.euwiki.articletopic \
420419
--input=$< \
@@ -616,7 +615,7 @@ word2vec/hywiki-20201201-learned_vectors.50_cell.10k.kv:
616615

617616
datasets/hywiki.balanced_article_sample.w_draft_cache.json: \
618617
datasets/hywiki.balanced_article_sample.w_draft_text.json \
619-
word2vec/hywiki-20200501-learned_vectors.50_cell.10k.kv
618+
word2vec/hywiki-20201201-learned_vectors.50_cell.10k.kv
620619
./utility extract_from_text \
621620
drafttopic.feature_lists.hywiki.drafttopic \
622621
--input=$< \
@@ -625,7 +624,7 @@ datasets/hywiki.balanced_article_sample.w_draft_cache.json: \
625624

626625
datasets/hywiki.balanced_article_sample.w_article_cache.json: \
627626
datasets/hywiki.balanced_article_sample.w_article_text.json \
628-
word2vec/hywiki-20200501-learned_vectors.50_cell.10k.kv
627+
word2vec/hywiki-20201201-learned_vectors.50_cell.10k.kv
629628
./utility extract_from_text \
630629
drafttopic.feature_lists.hywiki.articletopic \
631630
--input=$< \
@@ -934,7 +933,7 @@ word2vec/ukwiki-20201201-learned_vectors.50_cell.10k.kv:
934933

935934
datasets/ukwiki.balanced_article_sample.w_draft_cache.json: \
936935
datasets/ukwiki.balanced_article_sample.w_draft_text.json \
937-
word2vec/ukwiki-20200501-learned_vectors.50_cell.10k.kv
936+
word2vec/ukwiki-20201201-learned_vectors.50_cell.10k.kv
938937
./utility extract_from_text \
939938
drafttopic.feature_lists.ukwiki.drafttopic \
940939
--input=$< \
@@ -943,7 +942,7 @@ datasets/ukwiki.balanced_article_sample.w_draft_cache.json: \
943942

944943
datasets/ukwiki.balanced_article_sample.w_article_cache.json: \
945944
datasets/ukwiki.balanced_article_sample.w_article_text.json \
946-
word2vec/ukwiki-20200501-learned_vectors.50_cell.10k.kv
945+
word2vec/ukwiki-20201201-learned_vectors.50_cell.10k.kv
947946
./utility extract_from_text \
948947
drafttopic.feature_lists.ukwiki.articletopic \
949948
--input=$< \

drafttopic/feature_lists/euwiki.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44

55

66
euwiki_kvs = vectorizers.word2vec.load_gensim_kv(
7-
filename="euwiki-20200501-learned_vectors.50_cell.10k.kv", mmap='r')
7+
filename="euwiki-20201201-learned_vectors.50_cell.10k.kv", mmap='r')
88

99

1010
def vectorize_words(words):

drafttopic/feature_lists/hywiki.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44

55

66
hywiki_kvs = vectorizers.word2vec.load_gensim_kv(
7-
filename="hywiki-20200501-learned_vectors.50_cell.10k.kv", mmap='r')
7+
filename="hywiki-20201201-learned_vectors.50_cell.10k.kv", mmap='r')
88

99

1010
def vectorize_words(words):

model_info/euwiki.articletopic.md

Lines changed: 655 additions & 655 deletions
Large diffs are not rendered by default.

model_info/euwiki.drafttopic.md

Lines changed: 609 additions & 609 deletions
Large diffs are not rendered by default.

model_info/hywiki.articletopic.md

Lines changed: 661 additions & 661 deletions
Large diffs are not rendered by default.

model_info/hywiki.drafttopic.md

Lines changed: 671 additions & 671 deletions
Large diffs are not rendered by default.
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
11
version https://git-lfs.github.com/spec/v1
2-
oid sha256:58496fdfb02c6ea8d2ca225a217db2527c91d9fa1ddce9b5de1ca031f500afb9
3-
size 50012570
2+
oid sha256:fabe5d0742b13e43142e3a74bfab89ad0796204f9bfced4c48e1caad1ae3441b
3+
size 49564248
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
11
version https://git-lfs.github.com/spec/v1
2-
oid sha256:2eedbe29bc625dc94c56e3251cd6f21dfb0512c7c49f7126bce846c133c6be7b
3-
size 50021491
2+
oid sha256:74a5829d3868745f182bc6332bcb31b2f61aeaf69e56ac67cc5336921466f119
3+
size 50035018
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
11
version https://git-lfs.github.com/spec/v1
2-
oid sha256:841b3be3c4b1d7819a5097e065acf048af0ec4b585ce26cd316218181df62dcc
3-
size 49667890
2+
oid sha256:a02709f6cc5a7050bd02f075c16cab9b7ee23c7193887b3247fb5a1231e2dc46
3+
size 49819102

0 commit comments

Comments
 (0)