-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathdoc.go
17 lines (11 loc) · 1.16 KB
/
doc.go
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
// Copyright (c) 2019, The Emergent Authors. All rights reserved.
// Use of this source code is governed by a BSD-style
// license that can be found in the LICENSE file.
/*
Package auditory is the overall repository for audition processing code in Go (golang) focused on filtering speech wav files via mel filters. A further step using gabors provides filtering for input to neural networks. The processing code is split into 4 packages, sound, mel, dft and agabor, that can be used independently. A fifth package, trm, is a work in progress port of Gnuspeech. Example code is in examples/processspeech.
The `sound` package contains code for loading a wav file into a buffer and then converting to a floating point tensor. There are functions for trimming and padding.
The 'dft' package does a fourier transform and computes the power spectrum on the sound samples passed in.
The 'mel' package creates a set of mel filter banks and applies them to the power data to create a spectrogram.
The 'agabor' package produces an edge detector that detects oriented contrast transitions between light and dark which can be convolved with the output of the mel processing.
*/
package auditory