Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Publishing: Published datasets should contain files #10981

Open
sbarbosadataverse opened this issue Jul 19, 2024 · 4 comments · May be fixed by #10994
Open

Publishing: Published datasets should contain files #10981

sbarbosadataverse opened this issue Jul 19, 2024 · 4 comments · May be fixed by #10994
Labels
FY25 Sprint 9 FY25 Sprint 9 (2024-10-23 - 2024-11-06) Size: 30 A percentage of a sprint. 21 hours. (formerly size:33)

Comments

@sbarbosadataverse
Copy link

sbarbosadataverse commented Jul 19, 2024

The existing publishing model in HDV allows the publishing of datasets without files. This lends to the curation team contacting depositors about uploading files via support and often to deleting "empty" datasets.

To decrease the support contact and dataset deletions, change the publishing model to require all deposits contain at least one file before publishing can happen.

Related

  • Pending
@landreev
Copy link
Contributor

@sbarbosadataverse
I can fairly easily address this as another spam filter rule. So, an author would not be able to publish a dataset without files, but it would also result in an automatically opened RT ticket (just like with spam; although I could make it send a different email with a different subject).
But I'm realizing now that this may not be what you had in mind when you requested this (?).

@sbarbosadataverse
Copy link
Author

No ticket, maybe a message reading, "All deposits must contain data before publishing is allowed." @landreev
This is even better with "data" in it.

@cmbz
Copy link

cmbz commented Jul 24, 2024

2024/07/24

  • Sized at 33, depending on scope

@stevenwinship stevenwinship self-assigned this Oct 29, 2024
@cmbz cmbz transferred this issue from IQSS/dataverse.harvard.edu Oct 29, 2024
@cmbz cmbz added FY25 Sprint 9 FY25 Sprint 9 (2024-10-23 - 2024-11-06) Size: 30 A percentage of a sprint. 21 hours. (formerly size:33) labels Oct 29, 2024
@stevenwinship stevenwinship linked a pull request Oct 31, 2024 that will close this issue
@stevenwinship stevenwinship removed their assignment Oct 31, 2024
@jggautier
Copy link
Contributor

I've been working with a depositor who published datasets without files a while back and we've been working through some complications related to this. After seeing that @stevenwinship is working on this, this week I mentioned to the depositor that Harvard Dataverse are planning to require files before publishing datasets and I asked them for their thoughts on this. They wrote that they supported the idea 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
FY25 Sprint 9 FY25 Sprint 9 (2024-10-23 - 2024-11-06) Size: 30 A percentage of a sprint. 21 hours. (formerly size:33)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants