hifi-decode: Head Switching Filtering, IF to Audio resampling improvement by eshaz · Pull Request #187 · oyvindln/vhs-decode

eshaz · 2025-02-02T03:33:20Z

De-noise head switching pulses
- The audio is high pass filtered to remove audible sound and sent to scipy.signal.find_peaks to detect the peaks that fit closely to the video system field rate.
- Peaks are then smoothed using linear interpolation.
IF to audio resampling now uses sinc-fastest, sinc-medium for resampling quality medium and high options respectfully. This also removes the IF to audio low-pass filter, which is not needed with sinc resampling.
- The better resampling method removes artifacts and noise that was audible in some cases.
- On my machine, the decode is actually faster since the low-pass filter was removed.
- It also helps with the head switching detection to have a tighter impulse response on the pulses so they are more easily detected and interpolated
Overlap blocks in the beginning to avoid periodic clicks when each block is appended to the audio.
Improve multi-threading by offloading soundfile encoding to it's own process and various audio processing tasks to the process pool.
Fix file extension handling.

Head Switching Noise Reduction example:

Top: Without head switching noise reduction
Bottom: With head switching noise reduction

Head Switching peak detection debugging graph

eshaz · 2025-02-04T05:32:42Z

Running a few tests after this latest multi threading update. With this latest update, I am able to get .70x decode rate on a computer that could only do .35x before.

…tion, tune expander params

…es, put noise reduction into processes

eshaz · 2025-02-05T07:08:09Z

Tested thoroughly and everything is working well.

VideoMem · 2025-02-06T00:02:42Z

I had no luck running this PR first try.

There seems to be an error here, in main.py

  with as_soundfile(input_file) as f:
        progressB = TimeProgressBar(f.frames, f.frames)
        try:
            print(f"Starting decode...")
            for block, is_last_block in f.blocks(blocksize=block_size, overlap=read_overlap):
                if exit_requested:
                    break

f.blocks does not return two values, at least in my soundfile version API.

Got a workaround with this:

           for block in f.blocks(blocksize=block_size, overlap=read_overlap):
                is_last_block = f.tell() == f.frames
                if exit_requested:
                    break

In my system it runs at 0.09x, considerably slower than before.
It decodes nice sound.

The stop button in the GUI freezes the app, I don't know if the pause works yet.

There is some EOFError kind of exceptions written in the console at file ending and the application doesn't ends.
Maybe the EOFError is not being handled properly when the input is a file.

What soundfile version are you using?

eshaz · 2025-02-06T01:30:39Z

I updated the f.blocks call in the latest commit to fix an issue where piped data wasn't saving the last input, but forgot to double check the soundfile output. I'll get that fixed.

I have version 0.12.1 of soundfile installed. I believe it is the default version that comes in Ubuntu 24.04. I have noticed that the soundfile decoding of flac is really slow, and this might be causing the extra slowness for you. Soundfile decoding was recently added in #186, and before it was using ffmpeg, which decodes much faster, but has bugs for some people. I have been using flac and ffmpeg to pipe the decoded pcm through stdin, and this is much faster and I have never seen an issue doing this. I have an almost 20 year old Core 2 Duo machine that decodes at around 0.04x, but this is with using piped in data from flac. On my laptop, I get around 0.90x, and on an older server with lots of cores I'm getting around 0.97x.

Here's the command I'm using to test with:
flac -d -c --force-raw-format --endian little --sign signed hifi_rf.flac | ~/git/vhs-decode/hifi-decode -t 16 -n -f 20MHz --overwrite - hifi_rf_decoded.flac

I might be able to put the soundfile code in a thread or have it work asynchronously to try to speed it up, but it probably won't ever be as fast as using flac or ffmpeg

Clicking stop on the gui should clear out the decode queue and then gracefully stop. There is a delay before it actually stops while the queue is being cleared. I can update it so stop just immediately ends decoding, if that's what we want it to do.

All of the machines I own run Linux. I used the built in Python threading libraries that should be OS agnostic, but it's not impossible something is different on Windows or Mac.

eshaz · 2025-02-06T05:03:33Z

@VideoMem I fixed the blocks issue, the gui, and the EOF exceptions.

I put the block reading into an async task, which really helps keep the decoders working when piping in from stdin. It might help soundfile, but I suspect that it's CPU bound.

I also moved the sound player into a process so the preview mode has less skips now. I can almost get real-time playback on my laptop.

VideoMem · 2025-02-06T23:18:44Z

@eshaz
I found a minor issue ( it might be my numba version (0.59.0) ) when piping data from flac as you described.
Something about np.size() not supported.

I simply changed it to len() on @njit decorated functions and it started working.

Yes, there is some performance issues with soundfile, but by piping it goes a lot faster.
It overheated my laptop!

        # offset array copying logic implemented without numpy seems a bit faster
        out_int16_len = np.size(out_int16)
        overlap_size = min(overlap_size, out_int16_len)
        new_overlap = np.empty(overlap_size, dtype=np.int16)
        result = np.empty(overlap_size + out_int16_len, dtype=np.float64)
        
        # copy the overlapping data into result
        overlap_offset = out_int16_len - overlap_size
        if np.size(overlap_data) == 0:
            for i in range(overlap_size):
                overlap_value = out_int16[i + overlap_offset]
                
                new_overlap[i] = overlap_value
                result[i] = overlap_value

In main.py, changed it to:

        # offset array copying logic implemented without numpy seems a bit faster
        out_int16_len = len(out_int16)
        overlap_size = min(overlap_size, out_int16_len)
        new_overlap = np.empty(overlap_size, dtype=np.int16)
        result = np.empty(overlap_size + out_int16_len, dtype=np.float64)
        
        # copy the overlapping data into result
        overlap_offset = out_int16_len - overlap_size
        if len(overlap_data) == 0:
            for i in range(overlap_size):
                overlap_value = out_int16[i + overlap_offset]
                
                new_overlap[i] = overlap_value
                result[i] = overlap_value

All the other things I tested seems to be working.
The sound quality is good.

With that minor change it's ready for prime time.

eshaz · 2025-02-07T05:53:29Z

@VideoMem Fixed the np.size() issue and it works just fine using len().

I also noticed that some of the audio processing was unintentionally using double precision floating points. I updated these to use float32, and this sped things up considerably. I'm able to get 1.13x decode rate on my laptop now. There's also a constant defined that sets the precision in HiFiDecode.py if anyone wants to change it back to float64.

eshaz marked this pull request as ready for review February 2, 2025 06:44

eshaz marked this pull request as draft February 3, 2025 00:37

eshaz marked this pull request as ready for review February 3, 2025 02:43

eshaz marked this pull request as draft February 4, 2025 04:55

eshaz added 14 commits February 3, 2025 23:32

refactor sample rate conditions for consistency

2b3bb97

trim start and end to avoid artifacts between blocks

e2dbb62

implement headswitching peak detection and interpolation

502f4ea

use sinc resampling for medium and best, add 2 pass head switch detec…

b1007db

…tion, tune expander params

fix file extension handling

f7ce3a1

remove unused blocknum

4f1e361

remove current block parameter

ba2bec0

put expander values back where they were

421d6b7

fix interpolation bounds error by extrapolating

2e47c6b

better headswitching pulse width detection, smooth interpolation

393e9d1

add numba to post processor for better performance

6fcf37b

refactor head switching noise reduction

78cd6c1

trim out pulses at edges blocks, was causing issues with the expander

57e78cb

better logging, more efficient multiprocessing

ac9bc0d

eshaz force-pushed the hifi-decode-head-switch-filter branch from 9abd7db to ac9bc0d Compare February 4, 2025 05:34

eshaz added 8 commits February 4, 2025 10:30

faster input data copying

ec24804

use min overlap

d6a3563

don't wait on soundfile encode

2630207

create stateful processes for hifidecode decoders

fd965df

refactor multiprocessing code, move resampling to hifi decode process…

cfff0d4

…es, put noise reduction into processes

numba performance improvements, fix reentrant call on ctrl-c

d75cb40

use put_nowait rather than threads

692d9a9

throttle number of queued decoders

429a0a8

eshaz marked this pull request as ready for review February 5, 2025 07:08

handle last block in single threaded decode

619a1e6

eshaz added 5 commits February 5, 2025 21:08

make soundfile and stdin reads async, fix missing parameter

acd88cd

fix gui

dee44a5

remove linear resampling

0ef5eed

move sound player into process

b61001b

fix indentation

5adcd9b

clean up threads on exit, handle last block

e55edb3

harrypm added the Audio Related to HiFi-Decode or Audio Only Tapes label Feb 6, 2025

eshaz added 2 commits February 6, 2025 01:26

fix last block indication

945eb96

handle end of generator

3ade84d

eshaz added 2 commits February 6, 2025 23:21

use float32 where possible, fix numpy.size() issue

ca7f57b

use float32 when not resampling

52c33e9

VideoMem merged commit 56ca756 into oyvindln:vhs_decode Feb 7, 2025

eshaz deleted the hifi-decode-head-switch-filter branch March 8, 2025 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hifi-decode: Head Switching Filtering, IF to Audio resampling improvement#187

hifi-decode: Head Switching Filtering, IF to Audio resampling improvement#187
VideoMem merged 33 commits intooyvindln:vhs_decodefrom
eshaz:hifi-decode-head-switch-filter

eshaz commented Feb 2, 2025 •

edited

Loading

Uh oh!

eshaz commented Feb 4, 2025

Uh oh!

eshaz commented Feb 5, 2025

Uh oh!

VideoMem commented Feb 6, 2025

Uh oh!

eshaz commented Feb 6, 2025

Uh oh!

eshaz commented Feb 6, 2025

Uh oh!

VideoMem commented Feb 6, 2025

Uh oh!

eshaz commented Feb 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

eshaz commented Feb 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Head Switching Noise Reduction example:

Head Switching peak detection debugging graph

Uh oh!

eshaz commented Feb 4, 2025

Uh oh!

eshaz commented Feb 5, 2025

Uh oh!

VideoMem commented Feb 6, 2025

Uh oh!

eshaz commented Feb 6, 2025

Uh oh!

eshaz commented Feb 6, 2025

Uh oh!

VideoMem commented Feb 6, 2025

Uh oh!

eshaz commented Feb 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eshaz commented Feb 2, 2025 •

edited

Loading