You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We conducted a few superset users tests #3855 and compiled the feedback in this google doc. We agreed Superset mostly meets our requirements and is an improvement on dataset we identified a few concerns:
Superset is not a super customizable tool. If we wanted to make UI or feature changes in the future we'd need to fork the repo and get familiar with the sprawling javascript codebase. We're generally worried we'll outgrow the features and design goals of Superset.
Permissions: Superset meets our core requirement is to allow people to easily discover PUDL tables and download subsets of the data using a UI. We were hoping to also allow users to create their own dashboards and charts given Superset is a full BI tool. However, we realized that all users can see all saved charts and dashboards. This isn't ideal from a privacy, reputation and UX perspective. We probably would need to disable dashboard and chart making.
Programmatically creating table dashboards is not well supported. I was able to use the Superset dashboard yaml files and the API to create a script that can programmatically create dashboards for all our tables. However, the process for changing the template dashboard and apply those changes to all the table dashboards is cumbersome. We'd have to adjust the template dashboard in superset, download the yaml config file, figure out which pieces of the yaml file need to be parameterized, delete all the old table dashboards and then rerun the script.
Superset doesn't provide many features for making the data more discoverable. However, we managed to include the data dictionary in the welcome dashboard.
Generally, it feels like we're forcing Superset to be an open data portal when it's really designed to be a great BI tool for internal data. Therefore were going to do another pass at tool research before moving forward. That being said, Superset without letting users create their own dashboards and charts is still a solid backup options for improving our data delivery.
Requirements
We started this project at the end June 2024. Since then, we've talked to a ton energy data practitioners which changed our original requirements for this tool. Here are our requirements for this current research:
Must haves
Functionalities
Searchable list of all available tables
Access to data that’s currently not in SQLite (current Parquet files)
Display of table and column level descriptions/metadata
Filtering data and downloading as a CSV
Table preview
Updates when we update our data
Allow catalyst to create visualizations and share them publicly
Metrics
Collect user emails + collect metrics on table usage (not necessarily connected)
User self-registration (vs. us adding people manually) + recording user emails
Infrastructure
<5s for query response when there are 15 concurrent users
Operable by multiple Catalyst team members
Manage the UI as code so we don’t have to deal with a UI and we can version control the changes
Operations
Ability to have stable costs (e.g., by limiting usage)
We aren’t on the hook for 24/7 on-call maintenance - i.e. we are OK with it going down
Nice to have
Users have the ability to make charts
Ability to customize / configure the UI - e.g., make a dashboard, add explanatory text, having some modularity
Correlating users and their usage
Ability to create limited public view
Users can export and/or visualizations
UI Requirements
These feel like sub-requirements but I think are important to include:
Can add and remove columns. During testing some users felt overwhelmed when scrolling through our tables with hundreds of columns.
Filter using a drop down menu with auto complete. During testing some users were trusted by the lack of autocomplete in filter fields.
Can filter on any column
Tools to explore
BI/portal tools
Metabase
Redash
CKAN
... (please add more here)
Python web tools
dash
reflex
NiceGUI
Taipy
... (please add more here)
Parquet front ends
Perspective
... (please add more here)
Process
Agree on the requirements listed above
Agree on a list of tools to explore
Synchronously evaluate the tools on the requirements
The text was updated successfully, but these errors were encountered:
Background
We conducted a few superset users tests #3855 and compiled the feedback in this google doc. We agreed Superset mostly meets our requirements and is an improvement on dataset we identified a few concerns:
Generally, it feels like we're forcing Superset to be an open data portal when it's really designed to be a great BI tool for internal data. Therefore were going to do another pass at tool research before moving forward. That being said, Superset without letting users create their own dashboards and charts is still a solid backup options for improving our data delivery.
Requirements
We started this project at the end June 2024. Since then, we've talked to a ton energy data practitioners which changed our original requirements for this tool. Here are our requirements for this current research:
Must haves
Nice to have
UI Requirements
These feel like sub-requirements but I think are important to include:
Tools to explore
BI/portal tools
Python web tools
Parquet front ends
Process
The text was updated successfully, but these errors were encountered: