RADE V2 prototyping #42

drowe67 · 2025-01-17T05:41:09Z

Bandwidth/PAPR

Exploring ideas to improve 99% power bandwidth (spectral mask) from RADE V1. Just prototyping with "mixed rate" training and inference, i.e. no pilots or CP, genie phase.

Worked out how to put a BPF in training loop (conv1d with training disabled)
Take away that phase only (PAPR 0dB) works quite well
clip-BPF x 3 produces reasonable 99% power BW, 0dB PAPR, good loss
Doc ML EQ training and inference in README.md when we get to final V2 version. Just collect notes here in comments until then

Training:

python3 train.py --cuda-visible-devices 0 --sequence-length 400 --batch-size 512 --epochs 200 --lr 0.003 --lr-decay-factor 0.0001 ~/Downloads/tts_speech_16k_speexdsp.f32 250117_test --bottleneck 3 --h_file h_nc20_train_mpp.f32 --range_EbNo --plot_loss --auxdata --txbpf
Epoch 200 Loss 0.116

Testing:

./inference.sh 250117_test/checkpoints/checkpoint_epoch_200.pth wav/brian_g8sez.wav - --bottleneck 3 --auxdata --write_tx tx_bpf.f32 --write_latent z.f32 --txbpf
          Eb/No   C/No     SNR3k  Rb'    Eq     PAPR
Target..: 100.00  133.01   98.24  2000
Measured: 102.89          101.12       1243.47  0.00
loss: 0.121 BER: 0.000

octave:154> radae_plots; do_plots('z.f32','tx_bpf.f32')
bandwidth (Hz): 1255.813953 power/total_power: 0.990037

Red lines mark 99% power bandwidth:

ML EQ

Classical DSP:

python3 ml_eq.py --eq dsp --notrain --EbNodB 4 --phase_offset

MSE loss function:

python3 ml_eq.py --EbNodB 4 --phase_offset --lr 0.001 --epochs 100

Phase loss function:

python3 ml_eq.py --EbNodB 4 --phase_offset --lr 0.001 --epochs 100 --loss_phase

…looks good, PAPR 2dB with 2nd clipper

…iff training runs

…mparison

drowe67 · 2025-02-03T00:54:20Z

Frame 2 EQ examples

Ideal (perfect EQ)

python3 ml_eq.py --frame 2 --notrain --eq bypass --EbNodB 4
<snip>
EbNodB:  4.00 n_bits: 240000 n_errors: 3027 BER: 0.013

Classical DSP lin:

python3 ml_eq.py --eq dsp --notrain --EbNodB 4 --phase_offset --frame 2
<snip>
EbNodB:  4.00 n_bits: 240000 n_errors: 3921 BER: 0.016

ML EQ (using MSE loss function):

python3 ml_eq.py --frame 2 --lr 0.1 --epochs 100 --EbNodB 4 --phase_offset --n_syms 1000000 --batch_size 128
<snip>
EbNodB:  4.00 n_bits: 24000000 n_errors: 437933 BER: 0.018

drowe67 · 2025-02-04T00:56:57Z

ML waveform training

Generate 10 hour complex h file:

Fs=8000; Rs=50; Nc=20; multipath_samples('mpp', Fs, Rs, Nc, 10*60*60, 'h_nc20_train_mpp.c64',"",1);

Training:

python3 train.py --cuda-visible-devices 0 --sequence-length 400 --batch-size 512 --epochs 200 --lr 0.003 --lr-decay-factor 0.0001 ~/Downloads/tts_speech_16k_speexdsp.f32 250204_test --bottleneck 3 --h_file h_nc20_train_mpp.c64 --h_complex --range_EbNo --plot_loss --auxdata

drowe67 · 2025-11-11T20:19:09Z

Toolchain for JMV's adasmooth timing est with post proc

./inference.sh 250725/checkpoints/checkpoint_epoch_200.pth wav/all.wav /dev/null --rate_Fs --latent-dim 56 --peak --cp 0.004 --time_offset -16 --correct_time_offset -16 --auxdata --w1_dec 128 --write_rx 250725_rx_awgn.f32
./jmv_ft_tool.sh 250725_rx_awgn.f32 delta_hat.f32
./rx2.sh 250725/checkpoints/checkpoint_epoch_200.pth 251002_mpp_16k_ft 250725_ml_sync 250725_rx_awgn.f32 /dev/null --latent-dim 56 --w1_dec 128 --noframe_sync --read_delta_hat delta_hat.f32
python3 loss.py features_in.f32 features_out_rx2.f32 --plot --clip_start 25

Note --read_delta_hat delta_hat.f32 uses external timing est, so 251002_mpp_16k_ft not being used.

Testing FT est

Expected answer is Ncp=32, Octave has an off by one error:

./inference.sh 250725/checkpoints/checkpoint_epoch_200.pth wav/all.wav /dev/null --rate_Fs --latent-dim 56 --peak --cp 0.004 --time_offset -16 --correct_time_offset -16 --auxdata --w1_dec 128 --write_rx 250725_rx_awgn.f32
./jmv_ft_tool.sh 250725_rx_awgn.f32 delta_hat.f32 --no_bpf
octave:17> delta_hat=load_f32('delta_hat.f32',1)
<snip past initial transient>
33
33
33

Prototyping Signal Det using Rayleigh model

./jmv_ft.sh -5 5
Ry=load_c64('Ry.c64',160); [y,d]=adasmooth(Ry); figure(1); mesh(abs(y)); figure(2); hist(abs(y(:)),100);
T=0.5; e^(-(T^2)/var(y(:)))

Using sig_det and a single IIR filter:

./inference.sh 250725/checkpoints/checkpoint_epoch_200.pth wav/brian_g8sez.wav /dev/null --rate_Fs --latent-dim 56 --peak --cp 0.004 --time_offset -16 --correct_time_offset -16 --auxdata --w1_dec 128 --write_rx 250725_rx_awgn.f32 --prepend_noise 2
python3 autocorr_simple.py 250725_rx_awgn.f32 Ry.c64
Ry=load_c64('Ry.c64',160); [det,sigma_r,Ry_bar,Ts] = sig_det(Ry); figure(1); mesh(abs(Ry_bar)); figure(2); hist(abs(Ry_bar(:)),100); figure(3); plot(max(abs(Ry_bar'))); hold on; plot(Ts); hold off;

Integrating rx2.py

./inference.sh 250725/checkpoints/checkpoint_epoch_200.pth wav/brian_g8sez.wav /dev/null --rate_Fs --latent-dim 56 --peak --cp 0.004 --time_offset -16 --correct_time_offset -16 --auxdata --w1_dec 128 --write_rx 250725_rx_awgn.f32 --prepend_noise 2 --append_noise 2 --freq_offset 5
./rx2.sh 250725/checkpoints/checkpoint_epoch_200.pth 250725_ml_sync 250725_rx_awgn.f32 /dev/null --latent-dim 56 --w1_dec 128 --noframe_sync --write_delta_hat delta_hat.int16 --write_delta_hat_pp delta_hat_pp.int16 --write_sig_det sig_det.int16
octave:152> delta_hat=load_raw('delta_hat.int16'); figure(5); clf; plot(delta_hat); delta_hat_pp = load_raw('delta_hat_pp.int16'); hold on; plot(delta_hat_pp); sig_det=load_raw('sig_det.int16'); plot(sig_det*175); hold off; freq_offset=load_f32('freq_offset.f32',1); figure(6); plot(freq_offset)

WIP streaming

Does streaming odd/even frame sync & output of z_hat, decoder still done in one hit.

 ./inference.sh 250725/checkpoints/checkpoint_epoch_200.pth wav/brian_g8sez.wav /dev/null --rate_Fs --latent-dim 56 --peak --cp 0.004 --time_offset -16 --correct_time_offset -16 --auxdata --w1_dec 128 --write_rx 250725_rx_awgn.f32 --prepend_noise 1 --append_noise 2
./rx2.sh 250725/checkpoints/checkpoint_epoch_200.pth 250725_ml_sync 250725_rx_awgn.f32 /dev/null --latent-dim 56 --w1_dec 128 --write_delta_hat delta_hat.int16 --write_delta_hat_pp delta_hat_pp.int16 --write_sig_det sig_det.int16 --write_state state.int16 --write_freq_offset_smooth freq_offset_smooth.f32 --write_frame_sync frame_sync.f32 --noframe_sync
python3 loss.py features_in.f32 features_out_rx2.f32 --plot --clip_start 25 --clip_end 30

Note --clip_end as extra frames from post-pended noise upsets loss.py alignment (I think)

Initial pass of "four point" manual tests

021d1fb

Loss from inference.py (genie timing and freq) compared to rx2.py (timing and freq estimators):

Channel	SNR (dB)	inference.py	rx2.py
AWGN	high	0.083	0.081
AWGN	-4.4	0.407	0.401
MPP	high	0.101	0.107
MPP	-1.4	0.324	0.346

"high" SNR means the default --EbNodB 100, so effectively noise free. The low SNRs are at roughly the minimum possible for speech.

Four spot test points are high/low SNR, high/low SNR MPP. Command line for worst case low SNR MPP, -1.4dB SNR:

./inference.sh 250725/checkpoints/checkpoint_epoch_200.pth wav/all.wav /dev/null --rate_Fs --latent-dim 56 --peak --cp 0.004 --time_offset -16 --correct_time_offset -16 --auxdata --w1_dec 128 --write_rx 250725_rx.f32 --prepend_noise 1 --append_noise 2 --g_file g_mpp.f32 --EbNodB 5
./rx2.sh 250725/checkpoints/checkpoint_epoch_200.pth 250725_ml_sync 250725_rx.f32 /dev/null --latent-dim 56 --w1_dec 128 --write_delta_hat delta_hat.int16 --write_delta_hat_pp delta_hat_pp.int16 --write_sig_det sig_det.int16 --write_state state.int16 --write_freq_offset_smooth freq_offset_smooth.f32 --write_frame_sync frame_sync.f32 --hangover 100
python3 loss.py features_in.f32 features_out.f32 --features_hat2 features_out_rx2.f32 --clip_start 50 --clip_end 300
Loss between features_in.f32 and features_out.f32
  loss: 0.324 start: 50 acq_time:  0.50 s
Loss between features_in.f32 and features_out_rx2.f32
  loss: 0.346 start: 106 acq_time:  1.06 s

Notes:

We manually extend state machine --hangover 100 to 100 symbols (2 seconds), so that we stay in sync. A re-sync causes a break in the sequence of output feature vectors which breaks the loss.py measurement. In practice, a user may not notice anything, as they re-sync would occur during a deep fade. It may be OK just to use a large --hangover for cmd line testing, but a smaller hangover for running in the real world, due to the small frame size and low cost of re-sync compared to RADE V1.
Likewise the --clip_end 300 - this cuts of the garbage features at the end of the feature_out_rx2.f32 that tend to upset the way loss.py syncs between the two feature vectors. If you increase --hangover, good idea to increase --clip_end. Note that there a 4 features for every two OFDM symbols.
Priority is obtaining similar loss measurements with a real world acquisition system/state machine to running the bare bones ML decoder with genie timing and freq estimates. Haven't optimised acquisition time, and no end of over detection (yet) so we will have significant "run on" when transmission stops.

…h SNRs

…eta=0.999

…V2 rx2.py development

…t ctests

drowe67 added 25 commits January 13, 2025 16:57

wip BPF filter in training loop

4c19d43

BPF filter in training loop, trains up OK, loss a bit high, spectrum …

4adfc92

…looks good, PAPR 2dB with 2nd clipper

wip multiple clip/filter stages

1acf8b6

3 stage clip-filter gives us a reasonable 99% power BW of < 1300 Hz

9a8b175

clean up of 3 stage BPF-clip, 99% varies between 1250 and 1450Hz on d…

10bde8d

…iff training runs

51 tap filter works OK

0ffa3ff

supporting plots for RADE V2 PAPR/bandwidth paper

dce6360

AWGN/MPP plots for RADE V2 PAPR/bandwidth paper

c1899e6

250117_test model used for paper curves

d5f1383

wip first pass ML EQ

a1167fe

wip - correcting a random phase offset

52ecb9e

moved channel model into forward(), inference section plots scatter

39bb89d

way to bypass training/EQ to test vanilla channel simulation

6e65d7a

train with phase based loss function; classical lin DSP option for co…

772c237

…mparison

load/save models, Eb/No curves

db09c52

top level script to generate curves

4bd38b9

refactored to do de-framing, similar results

86bd4ff

wip framer class

3ebdfe3

generic complex to (real,img) float pairs

1f2f29a

dsp equaliser in framer

d04142b

reshaped frames to (batch,Nc,Ns)

629b249

wip frame 2, working for bypass and dsp

c2d1ffb

wip frame 2, converging well with ML and low BER

b4482a8

mleq_cvurves.sh set up for different frames, loss_phase working again

a77e1c9

getting good results from frame2 with constant phase offset

3ec7a49

added frequency offset to training, working quite well

9c09401

drowe67 added 2 commits February 5, 2025 12:40

first pass at training with complex h

ef67028

complex H option for inference

0b3a297

WIP JMV timing est an post prcoessor

735365c

drowe67 added 7 commits November 12, 2025 06:50

setting up toolchain for loss curve plotting with JMV's adasmooth FT est

29dac2b

reduce TC of post proc as we were getting loss spike with mpp

f042405

loss versus SNR for 25113_inf curve

6c8f3db

use older model

20716cf

building up sig det tool, prototype EWMA variance estimator

3f6211e

fixed threshold for sig_det, spot checks look good

4428708

refactor sig detection script, f_off output

1a8c3b2

drowe67 mentioned this pull request Dec 3, 2025

Port bpf changes from PR #58. #60

Merged

drowe67 added 20 commits December 5, 2025 09:18

building up acquisition algorithms in rx2.py, sensible results at hig…

0e764e8

…h SNRs

fixed up delta_hat post processing

439a0b1

basic RADE V2 decoding with good loss

c2d17b4

handling freq offsets for AWGN 100dB

8fc52be

wip frame sync

7f6c399

wip rx2.py

0a07fa0

fix conflict

e0d098a

odd/even frame sync metrics look sensible

e5f908b

wip: streaming output of z_hat working

9a6a0b7

more verbose error message when f_hat longer than f

f11d686

streaming frame sync, output of z_hat working OK

0b1bdb4

state machine tweaks, good perf at high and low SNR for AWGN, MPP a WIP

6144acf

wip MPP, various debug options, better results on MPP high SNR with b…

5a98d7d

…eta=0.999

binary framesync indicator

f7c7bcc

4 point tests passing OK when run mannually

021d1fb

first pass at '4 point' ctests to use as a baseline for further RADE …

dc3a9d5

…V2 rx2.py development

WIP fix ctests, just 12,14,15 failing now

217eb93

adjusted SNR an better time alignemnt using --clip_end to fix ota_tes…

441f325

…t ctests

loss.py tool now time aligns +/- 1 sec

0429614

correct acq time for padding

45cb6d2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RADE V2 prototyping #42

RADE V2 prototyping #42

Uh oh!

drowe67 commented Jan 17, 2025 •

edited

Loading

Uh oh!

drowe67 commented Feb 3, 2025 •

edited

Loading

Uh oh!

drowe67 commented Feb 4, 2025

Uh oh!

drowe67 commented Nov 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RADE V2 prototyping #42

Are you sure you want to change the base?

RADE V2 prototyping #42

Uh oh!

Conversation

drowe67 commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bandwidth/PAPR

ML EQ

Uh oh!

drowe67 commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Frame 2 EQ examples

Uh oh!

drowe67 commented Feb 4, 2025

ML waveform training

Uh oh!

drowe67 commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Toolchain for JMV's adasmooth timing est with post proc

Testing FT est

Prototyping Signal Det using Rayleigh model

Integrating rx2.py

WIP streaming

Initial pass of "four point" manual tests

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

drowe67 commented Jan 17, 2025 •

edited

Loading

drowe67 commented Feb 3, 2025 •

edited

Loading

drowe67 commented Nov 11, 2025 •

edited

Loading