CSV files download should be async #2350

manumoreira · 2024-07-02T19:07:14Z

In the last years CSV download has been generating several problems to the users. On medium to big surveys it ends up crashing the server frequently.
A solution for this problem can be uncoupling file creation from download.
This will imply a change in the user interaction.
Initially the user will click to create the file, the creation will be processed and once it is ready the user will be see a button to download it.
We will need mockups for this

(source)

Questions:

Will it be usefull to have an option to trigger the file creation for all the files of the survey?

matiasgarciaisaia · 2024-07-08T00:55:32Z

Let's note that the file which usually gives issues is the Interactions file (I don't remember having had issues with any other CSV file - but please correct me, @manumoreira).

But it may make a lot of sense to share the same approach between different files.

We could also generate the task (to create the files) and then notifiy the requester via email.

manumoreira · 2024-07-08T12:27:18Z

In some cases we've seen issues with the results file in large surveys.
I'd prefer not to add an email service for this, just to keep it simple.
A circular progress bar might be enough.

* Preload channels from survey.respondent_groups * fix respondent_controller_tests * preload all respondent_groups channels in the same query

Respondent files are usually large (Interactions files can grow up to 1M rows), and the "low" limit in queries made the DB work much more than needed (we've observed 99% CPU usage in the mysqld process when generating a 1M-rows interactions file with 1000 rows per query). Increasing this limit makes the app generate less queries to the DB, effectively driving the CPU usage down to about 30% instead. There's probably more room for improvement (the generation of the file is still CPU-bound instead of network-bound), but that's on the app itself - we should profile the app's code to further improve the performance. See #2350 See #2359 Co-authored-by: Gustavo Giráldez <[email protected]>

See #2350

We should un-skip them by the end of the PR. See #2350

The info about the generated files is still pending. See #2350

See #2350

This will allow us to check if there already is a file generated or not. Also, we move the decision of whether to regenerate a file or not to the user (instead of checking if we should generate the file again or not). See #2350

From the UI, request that the CSV files are generated by the backend. We still miss checking if the files exist or are currently being generated. See #2350

See #2350

Small changes, nothing too relevant. See #2350

There's probably still an issue with react-timeago not being properly ignored yet. See #2350

There are no definitions available. See #2350

Thanks, eslint! See #2350

anaPerezGhiglia mentioned this issue Jul 31, 2024

#2350: improve query for building interactions file #2359

Merged

matiasgarciaisaia pushed a commit that referenced this issue Aug 1, 2024

#2350: improve query for building interactions file (#2359)

abd0940

* Preload channels from survey.respondent_groups * fix respondent_controller_tests * preload all respondent_groups channels in the same query

matiasgarciaisaia mentioned this issue Aug 1, 2024

Increase DB chunk size for respondent files #2360

Merged

anaPerezGhiglia mentioned this issue Aug 5, 2024

#2350: Generate survey files async #2362

Draft

matiasgarciaisaia added a commit that referenced this issue Aug 13, 2024

Scaffold Survey Results tests

c9ee306

See #2350

matiasgarciaisaia added a commit that referenced this issue Aug 15, 2024

Scaffold Survey Results tests

4027e03

See #2350

matiasgarciaisaia added a commit that referenced this issue Aug 15, 2024

Keep moving tests

0e900c4

See #2350

matiasgarciaisaia added a commit that referenced this issue Aug 15, 2024

Make test suite "pass" by skipping broken tests

02a9384

We should un-skip them by the end of the PR. See #2350

matiasgarciaisaia added a commit that referenced this issue Sep 24, 2024

File downloads UI - first take

ed988f8

The info about the generated files is still pending. See #2350

matiasgarciaisaia added a commit that referenced this issue Sep 24, 2024

Stub downloading previously generated result files

607b382

See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 22, 2024

Typo

b72c41a

See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 22, 2024

Trigger files generation

45976ae

From the UI, request that the CSV files are generated by the backend. We still miss checking if the files exist or are currently being generated. See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 22, 2024

Statically serve the generated files

f0567f4

See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 22, 2024

Add file status endpoint

c20bf93

See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 22, 2024

Make API return a map of files instead of an array

a2fa8a1

See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 22, 2024

Fetch files status upon Downloads dialog load

3137cba

See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 22, 2024

Fetch files status upon download dialog

ceb100d

See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 29, 2024

Enable/disable file buttons depending on state

31169b7

See #2350

matiasgarciaisaia added a commit that referenced this issue Oct 31, 2024

Regularly fetch respondent files status

a5dda9b

See #2350

matiasgarciaisaia added a commit that referenced this issue Nov 5, 2024

Fix SurveyResults tests

59e67b7

Small changes, nothing too relevant. See #2350

matiasgarciaisaia added a commit that referenced this issue Nov 5, 2024

Fixing FlowJS typing

20d0720

There's probably still an issue with react-timeago not being properly ignored yet. See #2350

matiasgarciaisaia added a commit that referenced this issue Nov 6, 2024

Ignore Flow-typing react-timeago

05db608

There are no definitions available. See #2350

matiasgarciaisaia added a commit that referenced this issue Nov 6, 2024

Remove unused variables

b17c050

Thanks, eslint! See #2350

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSV files download should be async #2350

CSV files download should be async #2350

manumoreira commented Jul 2, 2024 •

edited by matiasgarciaisaia

Loading

matiasgarciaisaia commented Jul 8, 2024

manumoreira commented Jul 8, 2024

CSV files download should be async #2350

CSV files download should be async #2350

Comments

manumoreira commented Jul 2, 2024 • edited by matiasgarciaisaia Loading

matiasgarciaisaia commented Jul 8, 2024

manumoreira commented Jul 8, 2024

manumoreira commented Jul 2, 2024 •

edited by matiasgarciaisaia

Loading