Skip to content

AnnaShaleva/Echo-of-Moscow-scrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

Echo-of-Moscow scrapper

The aim of this project is to prepare the corpora of data for newral network training. We've got pairs of (audio, transcript) from https://echo.msk.ru/

Description

This repository contains two files:

  • urls.txt contains the list of available URLs to get texts from
  • extract_data_1.py contains functions for getting and parsing texts and audios from these URLs using BeautifulSoup

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages