Skip to content

a Python package for unbiased estimation of timescales and hypothesis testing

License

Notifications You must be signed in to change notification settings

roxana-zeraati/abcTau

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI

abcTau

abcTau is a Python package for unbiased estimation of timescales from autocorrelations or power spectrums using adaptive Approximate Bayesian Computations (aABC). This method overcomes the statistical bias in autocorrelations of finite data samples, which undermines the accuracy of conventional methods based on direct fitting of the autocorrelation with exponential decay functions. abcTau overcomes the bias by fitting the sample autocorrelation or power spectrum with a generative model based on a mixture of Ornstein-Uhlenbeck (OU) processes. This way it accounts for finite sample size and noise in data and returns a posterior distribution of timescales, that quantifies the uncertainty of estimates and can be used to compare alternative hypotheses about the dynamics of the underlying process. This method can be applied to any time-series data such as spike counts in neuroscience.

The details of the method are explained in:
Zeraati, R., Engel, T. A., & Levina, A. (2022). A flexible Bayesian framework for unbiased estimation of timescales. Nature Computational Science, 2(3), 193-204. https://doi.org/10.1038/s43588-022-00214-3
Please cite this paper (and if you can also the package DOI provided in the badge above) when you use this package for a scientific publication.

You can find Demos on how to use abcTau for estimating timescales and performing Bayesian model comparison in the Jupyter Notebook Tutorials 1 (fitting) and 2 (visualization and model comparison). These tutorials contain examples already used in figures 1, 3 and 5 of the paper above. The example data to test the package are available in example_data folder. The example outputs are available in example_abc_results and example_modelComparison. Three example python scripts are also available for running the package on a cluster with parallel processing. You can use the check_expEstimates function from the preprocessing module to check the bias in timescales estimated from exponential fits on your data.

Jupyter Notebooks that reproduce the paper figures are available in the paper_figures_notebooks folder.

For a recent application of this method see:
Zeraati, R., Shi, Y., Steinmetz, N. A., Gieselmann, M. A., Thiele, A., Moore, T., Levina, A. & Engel, T. A. (2023). Intrinsic timescales in the visual cortex change with selective attention and reflect spatial connectivity. Nature communications, 14(1), 1858. https://doi.org/10.1038/s41467-023-37613-7

The basis of aABC algorithm in this package is adapted from a previous implementation (originally developed in Python 2): Robert C. Morehead and Alex Hagen. A Python package for Approximate Bayesian Computation (2014). https://github.com/rcmorehead/simpleabc.

Operating system

  • macOS
  • Windows
  • Linux

Dependencies

  • Python >= 3.7.1
  • Numpy >= 1.15.4
  • Scipy >= 1.1.0

Installation

The abcTau package can be installed directly from pip:

pip install git+https://github.com/roxana-zeraati/abcTau.git

The typical time for installation (i.e. cloning the repository) is less than 10 sec.

Development

The object-oriented implementation of the package allows users to easily replace any function, including generative models, summary statistic computations, distance functions, etc., with their customized functions to better describe the statistics of their data. You can send us your customized generative models to be added directly to the package and create a larger database of generative models available for different applications (contact: [email protected])

List of available summary statistics (check summary_stats.py for details):

Ordered from fastest to slowest fitting:

  • comp_psd: computing the power spectral density
  • comp_ac_fft: computing autocorrelation using FFT to speed up (more biased for direct fitting)
  • comp_cc: computing autocorrelation in the time domain (less biased for direct fitting)

List of available generative models (check generative_models.py for details):

  • oneTauOU: one-timescale OU process
  • twoTauOU: two-timescale OU process
  • oneTauOU_oscil: one-timescale OU process with an additive oscillation
  • oneTauOU_twooscil: one-timescale OU process with two additive oscillations
  • oneTauOU_oneF: one-timescale OU process augmented with a 1/f background process (with variable exponent), in this function 1/f PSD can be replaced with any arbitrary PSD shape
  • oneTauOU_poissonSpikes: one-timescale inhomogeneous Poisson process
  • oneTauOU_gammaSpikes: one-timescale spiking process with gamma-distributed spike-counts and predefined dispersion
  • oneTauOU_gaussianSpikes: one-timescale spiking process with Gaussian-distributed spike-counts and predefined dispersion
  • oneTauOU_gammaSpikes_withDispersion: one-timescale spiking process with gamma-distributed spike-counts
  • oneTauOU_gaussianSpikes_withDispersion: one-timescale spiking process with Gaussian-distributed spike-counts
  • twoTauOU_poissonSpikes: two-timescale inhomogeneous Poisson process
  • twoTauOU_gammaSpikes: two-timescale spiking process with gamma-distributed spike-counts and predefined dispersion
  • twoTauOU_gaussianSpikes: two-timescale spiking process with Gaussian-distributed spike-counts and predefined dispersion
  • twoTauOU_gammaSpikes_withDispersion: two-timescale spiking process with gamma-distributed spike-counts
  • twoTauOU_gaussianSpikes_withDispersion: two-timescale spiking process with Gaussian-distributed spike-counts

List of available distance functions (check distance_functions.py for details):

  • linear_distance
  • logarithmic_distance