coverage.md

What testing coverage approaches are needed?

Dependencies: version control (check commit ID dates)
see: requirements.txt
run: find *txt . | grep extra
Docstring tests: commented function signatures
(of functions intended for outer calls)
Unittests per function-critical code block
Integration tests for vanilla experiments to cover use-cases on a generic task basis
Regression testing: standing interfaces & their refactoring
Linters for automated style checks & corrections of python & yaml code

Where to get things done?

Raise your questions & engage in Discussions
Report a bug or request a feature, open Issues
Contribute Pull requests
Release pretrained models through SpeechBrain
e.g. registering linking HuggingFace account to SpeechBrain for hosting your model card

GitHub workflow: strategy by configuration

API configurations are located at .github/workflows
(all creating a one-time ubuntu-latest environment)

Info: although our PyTorch requirements are

torch>=1.9.0
torchaudio>=0.9.0

our tests cover one PyTorch version only, the latest.

pre-commit.yml

SpeechBrain pre-commit / pre-commit (pull_request)

python-version: '3.12'
run pre-commit action, configured in .pre-commit-config.yaml
- hook: https://github.com/pre-commit/pre-commit-hooks
  trailing-whitespace
  end-of-file-fixer
  requirements-txt-fixer
  mixed-line-ending
  check-added-large-files
- hook: https://github.com/psf/black
  black
  click
- hook: https://github.com/astral-sh/ruff-pre-commit
  ruff; see: pyproject.toml for configuration
- hook: https://github.com/adrienverge/yamllint
  yamllint; see: .yamllint.yaml

pythonapp.yml

SpeechBrain toolkit CI / Tests (3.10) (pull_request)
SpeechBrain toolkit CI / Tests (3.13) (pull_request)

python-version: ["3.10", 3.13]

create fresh environment

sudo apt-get install -y libsndfile1
pip install -r requirements.txt
pip install --editable .
pip install ctc-segmentation

run PyTest checks
see: pytest.ini - files: test_*.py; check_*.py; example_*.py & norecursedirs
see: conftest.py - prepare test item collection & direct discovery
```
# excerpts
parser.addoption("--device", action="store", default="cpu")
...
```
- a. hook: Consistency tests with pytest
  pytest tests/consistency
- b. hook: Unittests with pytest
  pytest tests/unittests
- c. hook: Doctests with pytest
  pytest --doctest-modules speechbrain
- d. hook: Integration tests with pytest
  pytest tests/integration

verify-docs-gen.yml [I.2.a]

Verify docs generation / docs (pull_request)

python-version: '3.12'

create fresh environment

pip install -r requirements.txt
pip install --editable .
pip install -r docs/docs-requirements.txt

generates docs
```
cd docs
make html
```
compare: .readthedocs.yaml - python version: 3.8

newtag.yml

Draft release when pushing new tag

tagging of develop branch commit ID
before
- follow through tests/PRE-RELEASE-TESTS.md
  - set-up fresh environment
  - run pytest
  - a. hook: tests/.run-load-yaml-tests.sh
  - b. hook: tests/.run-recipe-tests.sh
  - c. hook: tests/.run-HF-checks.sh
  - d. hook: ests/.run-url-checks.sh
- update of speechbrain/version.txt to the next
action: draft push to main branch
implies pre-push hook, see: .pre-push-config.yaml with hooks to:
- e. tests/.run-linters.sh
- f. tests/.run-unittests.sh
- g. tests/.run-doctests.sh

release.yml

Publish to PyPI

python-version: 3.12
action: checkout to main branch
creates: pypa/build for binary wheel and source tarball
action: Publish to PyPI via pypa/gh-action-pypi-publish@master
implies use of
- LICENSE
- README.md
- pyproject.toml - target-version = ['py38']
  - python_requires=">=3.8.1",
  - uses: speechbrain/version.txt
  - requires:
```
 "hyperpyyaml",
 "joblib",
 "numpy",
 "packaging",
 "scipy",
 "sentencepiece",
 "torch>=1.9",
 "torchaudio",
 "tqdm",
 "huggingface_hub",
```
  - points to https://speechbrain.github.io/

The versions of tools used/hooked in these checks are controlled via lint-requirements.txt, a nested dependency in requirements.txt. With major version releases of SpeechBrain, the versions of each hook should be updated—alongside requirement consistency in source, testing & builds incl. running spell-checking.

PyTest for reporting code coverage rates

How to know test coverage changes of Open PRs to be merged?
(snippet for cpu-only)

# Example: install more dependencies to avoid ignoring modules
sudo apt install -y libsndfile1
pip install ctc_segmentation

# install coverage
pip install pytest-cov

# run the test (w/ duration reporting)
pytest --durations=0 --cov=speechbrain --cov-context=test --doctest-modules speechbrain tests

Example: After collecting 459 testing items, 4481/16782 statements are reported "missing" (73% coverage).

YET—python code of the core modules is not all to be covered; thus far, only, consistency is ensured..

Further reading:
pytest & coverage - https://breadcrumbscollector.tech/how-to-use-code-coverage-in-python-with-pytest/ (pointer by @Adel-Moumen)

pytest --durations=0 --cov=speechbrain --cov-context=test --doctest-modules speechbrain tests

---------- coverage: platform linux, python 3.9.12-final-0 -----------
Name                                                      Stmts   Miss  Cover
-----------------------------------------------------------------------------
speechbrain/alignment/aligner.py                            380     61    84%
speechbrain/alignment/ctc_segmentation.py                   189     10    95%
speechbrain/core.py                                         424    155    63% <== < 80%
speechbrain/dataio/batch.py                                  99      8    92%
speechbrain/dataio/dataio.py                                279     50    82%
speechbrain/dataio/dataloader.py                            140     25    82%
speechbrain/dataio/dataset.py                               100      8    92%
speechbrain/dataio/encoder.py                               328     46    86%
speechbrain/dataio/iterators.py                              80     62    22% <== < 80%
speechbrain/dataio/legacy.py                                121     41    66% <== < 80%
speechbrain/dataio/preprocess.py                             22      4    82%
speechbrain/dataio/sampler.py                               224     61    73% <== < 80%
speechbrain/dataio/wer.py                                    63     54    14% <== < 80%
speechbrain/decoders/ctc.py                                 111     89    20% <== < 80%
speechbrain/decoders/seq2seq.py                             370     46    88%
speechbrain/decoders/transducer.py                          133     64    52% <== < 80%
speechbrain/lm/arpa.py                                       77      3    96%
speechbrain/lm/counting.py                                   37      4    89%
speechbrain/lm/ngram.py                                      36      1    97%
speechbrain/lobes/augment.py                                154     55    64% <== < 80%
speechbrain/lobes/beamform_multimic.py                       20     14    30% <== < 80%
speechbrain/lobes/features.py                                96      9    91%
speechbrain/lobes/models/CRDNN.py                            52     12    77% <== < 80% Ruff #29
speechbrain/lobes/models/transformer/Transformer.py         180     22    88%
speechbrain/lobes/models/transformer/TransformerASR.py       92     28    70% <== < 80%
speechbrain/lobes/models/transformer/TransformerLM.py        47      5    89%
speechbrain/lobes/models/transformer/TransformerSE.py        20      2    90%
speechbrain/lobes/models/transformer/TransformerST.py        81     60    26% <== < 80%
speechbrain/lobes/models/wav2vec.py                         123     55    55% <== < 80%
speechbrain/nnet/CNN.py                                     417     56    87%
speechbrain/nnet/RNN.py                                     471     51    89%
speechbrain/nnet/activations.py                              39      1    97%
speechbrain/nnet/attention.py                               234     44    81%
speechbrain/nnet/complex_networks/c_CNN.py                  130     23    82%
speechbrain/nnet/complex_networks/c_RNN.py                  374     67    82%
speechbrain/nnet/complex_networks/c_normalization.py        277     68    75% <== < 80%
speechbrain/nnet/complex_networks/c_ops.py                  108     40    63% <== < 80%
speechbrain/nnet/containers.py                              139     14    90%
speechbrain/nnet/linear.py                                   27      1    96%
speechbrain/nnet/loss/si_snr_loss.py                         20     16    20% <== < 80%
speechbrain/nnet/loss/stoi_loss.py                           81      1    99%
speechbrain/nnet/losses.py                                  323    112    65% <== < 80%
speechbrain/nnet/normalization.py                           142      6    96%
speechbrain/nnet/pooling.py                                 156     31    80%
speechbrain/nnet/quantisers.py                               47      2    96%
speechbrain/nnet/quaternion_networks/q_CNN.py               150     25    83%
speechbrain/nnet/quaternion_networks/q_RNN.py               370     59    84%
speechbrain/nnet/quaternion_networks/q_linear.py             50     11    78% <== < 80%
speechbrain/nnet/quaternion_networks/q_normalization.py      44      4    91%
speechbrain/nnet/quaternion_networks/q_ops.py               229    122    47% <== < 80%
speechbrain/nnet/schedulers.py                              363    103    72% <== < 80%
speechbrain/nnet/transducer/transducer_joint.py              33      5    85%
speechbrain/pretrained/fetching.py                           48      6    88%
speechbrain/pretrained/interfaces.py                        786    338    57% <== < 80%
speechbrain/pretrained/training.py                           33     28    15% <== < 80%
speechbrain/processing/PLDA_LDA.py                          345     96    72% <== < 80%
speechbrain/processing/decomposition.py                     102      8    92%
speechbrain/processing/diarization.py                       319    157    51% <== < 80%
speechbrain/processing/features.py                          359     75    79% <== < 80%
speechbrain/processing/multi_mic.py                         345      2    99%
speechbrain/processing/signal_processing.py                 166     39    77% <== < 80%
speechbrain/processing/speech_augmentation.py               386     34    91%
speechbrain/tokenizers/SentencePiece.py                     181     74    59% <== < 80%
speechbrain/utils/Accuracy.py                                24     17    29% <== < 80%
speechbrain/utils/DER.py                                     44     33    25% <== < 80%
speechbrain/utils/bleu.py                                    50     43    14% <== < 80%
speechbrain/utils/callchains.py                              28      5    82%
speechbrain/utils/checkpoints.py                            294     52    82%
speechbrain/utils/data_pipeline.py                          181     15    92%
speechbrain/utils/data_utils.py                             197     77    61% <== < 80%
speechbrain/utils/depgraph.py                                82      1    99%
speechbrain/utils/distributed.py                             61     37    39% <== < 80%
speechbrain/utils/edit_distance.py                          180     50    72% <== < 80%
speechbrain/utils/epoch_loop.py                              55     22    60% <== < 80%
speechbrain/utils/hparams.py                                  2      1    50% <== < 80%
speechbrain/utils/hpopt.py                                  134     41    69% <== < 80%
speechbrain/utils/logger.py                                  73     45    38% <== < 80%
speechbrain/utils/metric_stats.py                           285     48    83%
speechbrain/utils/parameter_transfer.py                      87     17    80%
speechbrain/utils/profiling.py                              191     54    72% <== < 80%
speechbrain/utils/superpowers.py                             20      6    70% <== < 80%
speechbrain/utils/text_to_sequence.py                        77     22    71% <== < 80%
speechbrain/utils/torch_audio_backend.py                      9      2    78% <== < 80%
speechbrain/utils/train_logger.py                           150    113    25% <== < 80%
speechbrain/wordemb/transformer.py                           90     67    26% <== < 80%
-----------------------------------------------------------------------------
TOTAL                                                     16782   4481    73%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What testing coverage approaches are needed?

Where to get things done?

GitHub workflow: strategy by configuration

pre-commit.yml

pythonapp.yml

verify-docs-gen.yml [I.2.a]

newtag.yml

release.yml

PyTest for reporting code coverage rates

FilesExpand file tree

coverage.md

Latest commit

History

coverage.md

File metadata and controls

What testing coverage approaches are needed?

Where to get things done?

GitHub workflow: strategy by configuration

pre-commit.yml

pythonapp.yml

verify-docs-gen.yml [I.2.a]

newtag.yml

release.yml

PyTest for reporting code coverage rates