Skip to content

'NoneType' object has no attribute 'text' #85

@byronknoll

Description

@byronknoll

Hi, I trying to run wikipedia2vec on a Wikipedia dump from 2006. I am not sure if this older version is the cause of the issue, but I get the following error when running:

wikipedia2vec train enwik9.bz2 out_file --dim-size=300 --iteration=10 --negative=15 --min-entity-count=0
[2024-06-18 18:24:57,871] [INFO] Starting to build a Dump DB... (train@cli.py:167)
Traceback (most recent call last):
  File "/home/byron/.local/bin/wikipedia2vec", line 8, in <module>
    sys.exit(cli())
  File "/usr/lib/python3/dist-packages/click/core.py", line 1128, in __call__
    return self.main(*args, **kwargs)
  File "/usr/lib/python3/dist-packages/click/core.py", line 1053, in main
    rv = self.invoke(ctx)
  File "/usr/lib/python3/dist-packages/click/core.py", line 1659, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
  File "/usr/lib/python3/dist-packages/click/core.py", line 1395, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/usr/lib/python3/dist-packages/click/core.py", line 754, in invoke
    return __callback(*args, **kwargs)
  File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 61, in wrapper
    return func(*args, **kwargs)
  File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 83, in wrapper
    return func(*args, **kwargs)
  File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 129, in wrapper
    return func(*args, **kwargs)
  File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 32, in wrapper
    return func(*args, **kwargs)
  File "/usr/lib/python3/dist-packages/click/decorators.py", line 26, in new_func
    return f(get_current_context(), *args, **kwargs)
  File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 169, in train
    invoke(build_dump_db, out_file=dump_db_file)
  File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 165, in invoke
    ctx.invoke(cmd, **cmd_kwargs)
  File "/usr/lib/python3/dist-packages/click/core.py", line 754, in invoke
    return __callback(*args, **kwargs)
  File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 32, in wrapper
    return func(*args, **kwargs)
  File "/home/byron/.local/lib/python3.10/site-packages/wikipedia2vec/cli.py", line 211, in build_dump_db
    DumpDB.build(dump_reader, out_file, **kwargs)
  File "wikipedia2vec/dump_db.py", line 161, in wikipedia2vec.dump_db.DumpDB.build
  File "wikipedia2vec/dump_db.py", line 183, in wikipedia2vec.dump_db.DumpDB.build
  File "wikipedia2vec/dump_db.py", line 187, in wikipedia2vec.dump_db.DumpDB.build
  File "/usr/lib/python3.10/multiprocessing/pool.py", line 451, in <genexpr>
    return (item for chunk in result for item in chunk)
  File "/usr/lib/python3.10/multiprocessing/pool.py", line 873, in next
    raise value
AttributeError: 'NoneType' object has no attribute 'text'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions