-
Notifications
You must be signed in to change notification settings - Fork 102
Open
Description
Hi, I trying to run wikipedia2vec on a Wikipedia dump from 2006. I am not sure if this older version is the cause of the issue, but I get the following error when running:
wikipedia2vec train enwik9.bz2 out_file --dim-size=300 --iteration=10 --negative=15 --min-entity-count=0
[2024-06-18 18:24:57,871] [INFO] Starting to build a Dump DB... (train@cli.py:167)
Traceback (most recent call last):
File "/home/byron/.local/bin/wikipedia2vec", line 8, in <module>
sys.exit(cli())
File "/usr/lib/python3/dist-packages/click/core.py", line 1128, in __call__
return self.main(*args, **kwargs)
File "/usr/lib/python3/dist-packages/click/core.py", line 1053, in main
rv = self.invoke(ctx)
File "/usr/lib/python3/dist-packages/click/core.py", line 1659, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
File "/usr/lib/python3/dist-packages/click/core.py", line 1395, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/usr/lib/python3/dist-packages/click/core.py", line 754, in invoke
return __callback(*args, **kwargs)
File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 61, in wrapper
return func(*args, **kwargs)
File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 83, in wrapper
return func(*args, **kwargs)
File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 129, in wrapper
return func(*args, **kwargs)
File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 32, in wrapper
return func(*args, **kwargs)
File "/usr/lib/python3/dist-packages/click/decorators.py", line 26, in new_func
return f(get_current_context(), *args, **kwargs)
File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 169, in train
invoke(build_dump_db, out_file=dump_db_file)
File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 165, in invoke
ctx.invoke(cmd, **cmd_kwargs)
File "/usr/lib/python3/dist-packages/click/core.py", line 754, in invoke
return __callback(*args, **kwargs)
File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 32, in wrapper
return func(*args, **kwargs)
File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 211, in build_dump_db
DumpDB.build(dump_reader, out_file, **kwargs)
File "wikipedia2vec/dump_db.py", line 161, in wikipedia2vec.dump_db.DumpDB.build
File "wikipedia2vec/dump_db.py", line 183, in wikipedia2vec.dump_db.DumpDB.build
File "wikipedia2vec/dump_db.py", line 187, in wikipedia2vec.dump_db.DumpDB.build
File "/usr/lib/python3.10/multiprocessing/pool.py", line 451, in <genexpr>
return (item for chunk in result for item in chunk)
File "/usr/lib/python3.10/multiprocessing/pool.py", line 873, in next
raise value
AttributeError: 'NoneType' object has no attribute 'text'
Metadata
Metadata
Assignees
Labels
No labels