- update: Packages for fixing vulnerabilities
- update: Reftime handling works if reftime is malformed
- add: validations for the input file for conversation tagging
- PL-61: Add retry mechanism for uploading data to Label studio
- PL-1300: new data format support & extra meta-annotation-columns (#21)
- fix: alternatives column postprocessing after downloading from labelstudio
- add: required paramter --data-label for uploading datasets
- add: modified downloading from db to integrate labelstudio db queries
- fix: data format issue after downloading from labelstudio
- fix: pass project_id for LS and job_id for TOG
- add: makes batches of batched_dataset while uploading to tog db - limits max async connections to db at a time
- add: retries with sleep while uploading data to tog db
- fix: bug(#13) passing of creds into creating Job as None
- fix: datetime formatting produces
None
if none of the formats match. - fix: connection timeout mitigated by batching the requests.
- update: data download uses start and end date for query only if provided.
- add: error handling for uploads.
- add: job-id to dataset.
- add: json cols are unpacked.
- feat: Add labelstudio integration. We can upload/download datasets to labelstudio.
- update: Dedupe id uses uuid4 at runtime.
- fix: Parse timezone from strings.
- add: Support for legacy dataframes with alternatives.
- update: gh actions script updates.
- update: Data upload is more tolerant or utterance json vs python object.
- update: JSON fields are stringified.
- update: describe and stat dataset requires database object instance optionally.
- update: deps for compatibility with skit-pipelines.
- update: Higher tolerance for dataset schema errors.
- fix: -j added back to upload dataset command.
- update: remove preprocessing over df.
- fix: bug preventing data uploads due to larger than supported values in source.
- update: dataset {stat, describe} can apply date ranges.
- update: CLI allows database params as input.
- update: obtain type of the dataset from
Job::type()
.
- refactor: no difference in usage.
- refactor: dependency on skit-fixdf removed.
- feat: Upload datasets via cli.
- feat: Access datasets from dvc.