SpecAugment Layer added #93

Toku11 · 2020-09-03T17:56:22Z

I didn't add white noise layer because it has been removed

…to kapre-0.3.3

keunwoochoi

Thanks for the initiative. Because the target branch already had keras.augmentation, there's a merge conflict but it wouldn't be anything too annoying.

kapre/augmentation.py

kapre/backend.py

Toku11 · 2020-09-03T20:41:14Z

Ok, let me know if you need something else, I created a branch from master and there was no augmentation :D I didn't see 0.3.3 branch hh

keunwoochoi

Thanks for the following up. I requested some changes. I acknowledge that this could be not a very little amount of work, but please understand that I have to be responsible for the consistency of the package.
Besides the comments, some unit tests for the expected behaviors of the function and layer + save/load test of the layer is a mandatory, as done for other layers.. so that we can trust Kapre!

keunwoochoi · 2020-09-04T17:12:00Z

kapre/backend.py

-    """Apply masking to a spectrogram in the freq domain.
-    TensorFlow/io
-
+def random_masking_along_axis(input_f, param, axis, name='masking'):


as in the example i prototyped (

def random_masking_along_axis(x, axis, x_min=0, x_max=None, max_width=None): if x_max is None: x_max = x.shape[axis] if max_width is None: max_width = (x_max - x_min) // 4 # 4 is an arbitrary choice though # do the work.. ```) I think this `param` should be unpacked and listed in this API.

keunwoochoi · 2020-09-04T17:13:03Z

kapre/backend.py

-
+def random_masking_along_axis(input_f, param, axis, name='masking'):
+    """
+    Apply masking to a spectrogram in the time/freq domain.
    Args:
      input: An audio spectogram.


name mistmatch. also, the format would be..

input (`Tensor`): Audio input spectrogram

keunwoochoi · 2020-09-04T17:14:20Z

kapre/backend.py

    Returns:
      A tensor of spectrogram.
    """
    # TODO: Support audio with channel > 1.
-    freq_max = tf.shape(input_f)[1]
+    _max = tf.shape(input_f)[axis + 1]
+    _shape = [-1, 1, 1, 1]


I don't think we need to reshape the input tensor. We can instead keep the code cleaner/shorter by just passing the right axis argument in the layer implementation. At the same time, this function would just blindly do masking over the specified axis.

keunwoochoi · 2020-09-04T17:18:20Z

kapre/backend.py

    )
-    indices = tf.reshape(tf.range(freq_max), (-1,freq_max,1,1))
+    indices = tf.reshape(tf.range(_max), tuple(_shape))


now because we're taking an arbitrary shaped input tensor, the reshaping should be taken care well. and that should be verified with unittest.

keunwoochoi · 2020-09-04T17:25:14Z

kapre/augmentation.py

@@ -40,10 +45,9 @@ def __init__(

        self.freq_param = freq_param


if the backend API changes, the API of this layer should follow. Maybe it should be like

def __init__(self, time_min=0, time_max=None, time_max_width=None, freq_min=0, freq_max=None, freq_max_width=None, data_format='default', **kwargs):

where if time_min is None, it doesn't mask over time (and same for frequency axis).
Please note that all Kapre layers are compatible with both channels_first and channels_last data format, so this one should be, too.

bagustris · 2021-08-16T09:33:27Z

Any progress on this PR? Having SpecAugment as Keras layer would be very useful for the DL-based audio processing community...

Toku11 · 2021-08-16T13:28:54Z

Any progress on this PR? Having SpecAugment as Keras layer would be very useful for the DL-based audio processing community...

Hi bagustris, unfortunately I was working in a project at that moment and I didn't have time to do proper test and documentation as per requested, however there is a functional version, I would really appreciate if you can continue with the work and make a pull request in the repository, please check my code https://github.com/Toku11/kapre/blob/d64e19d4917a5c7f8f109b2cfe5b7e06d118b8e2/kapre/augmentation.py#L21

MichaelisTrofficus · 2021-11-10T17:25:24Z

Hi, it's a pity I hadn't seen this pull request before. A couple of days ago I uploaded a package to Pypi where I do just this, that is, a custom layer of tensorflow.keras that implements the SpecAugment technique.

This is the repo if you want to take a look: https://github.com/MichaelisTrofficus/spec_augment

If you see that it can be useful I can adapt it to kapre and make a new pull request with proper testing and documentation or simply continue this one but with my own implementation.

keunwoochoi and others added 4 commits September 2, 2020 13:23

draft for augmentation

f6e6d94

draft for augmentation

3cf5980

Merge branch 'kapre-0.3.3' of https://github.com/keunwoochoi/kapre in…

e55c20e

…to kapre-0.3.3

SpecAugment Layer added

0b83a83

keunwoochoi changed the base branch from master to kapre-0.3.3 September 3, 2020 18:02

keunwoochoi requested changes Sep 3, 2020

View reviewed changes

Time and Freq masking merged, Docstring fixed, if instead of assert

de42bcd

Merge branch 'kapre-0.3.3' into noise

2e0d4c0

keunwoochoi requested changes Sep 4, 2020

View reviewed changes

keunwoochoi force-pushed the kapre-0.3.3 branch 2 times, most recently from 5f3f956 to 6f45b89 Compare September 15, 2020 02:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SpecAugment Layer added #93

SpecAugment Layer added #93

Toku11 commented Sep 3, 2020

keunwoochoi left a comment

Toku11 commented Sep 3, 2020 •

edited

Loading

keunwoochoi left a comment

keunwoochoi Sep 4, 2020

keunwoochoi Sep 4, 2020

keunwoochoi Sep 4, 2020

keunwoochoi Sep 4, 2020

keunwoochoi Sep 4, 2020

bagustris commented Aug 16, 2021

Toku11 commented Aug 16, 2021 •

edited

Loading

MichaelisTrofficus commented Nov 10, 2021 •

edited

Loading

		@@ -40,10 +45,9 @@ def __init__(

		self.freq_param = freq_param

SpecAugment Layer added #93

Are you sure you want to change the base?

SpecAugment Layer added #93

Conversation

Toku11 commented Sep 3, 2020

keunwoochoi left a comment

Choose a reason for hiding this comment

Toku11 commented Sep 3, 2020 • edited Loading

keunwoochoi left a comment

Choose a reason for hiding this comment

keunwoochoi Sep 4, 2020

Choose a reason for hiding this comment

keunwoochoi Sep 4, 2020

Choose a reason for hiding this comment

keunwoochoi Sep 4, 2020

Choose a reason for hiding this comment

keunwoochoi Sep 4, 2020

Choose a reason for hiding this comment

keunwoochoi Sep 4, 2020

Choose a reason for hiding this comment

bagustris commented Aug 16, 2021

Toku11 commented Aug 16, 2021 • edited Loading

MichaelisTrofficus commented Nov 10, 2021 • edited Loading

Toku11 commented Sep 3, 2020 •

edited

Loading

Toku11 commented Aug 16, 2021 •

edited

Loading

MichaelisTrofficus commented Nov 10, 2021 •

edited

Loading