cover

Using Language Models to Simulate Human Samples: Appendix

11 Jun 2024

Explore the impact and challenges surrounding the adoption of datasheets for datasets.

cover

Using Language Models to Simulate Human Samples: Acknowledgments and References

11 Jun 2024

Explore the impact and challenges surrounding the adoption of datasheets for datasets.

cover

Datasheets for Datasets: Impact and Adoption Across Academic and Industry Sectors

11 Jun 2024

Explore the impact and challenges surrounding the adoption of datasheets for datasets.

cover

Ensuring Dataset Health: Strategies for Effective Maintenance and Support

11 Jun 2024

Learn about strategies for maintaining and supporting datasets, including erratum management, updates, and data retention policies

cover

Guidelines for Sharing AI Datasets Responsibly

11 Jun 2024

Learn how to navigate intellectual property issues and ensure responsible data sharing.

cover

Applications of ML Model Datasets

10 Jun 2024

Explore the wide array of tasks for which datasets can be utilized, along with potential risks and ethical considerations.

cover

From Raw to Refined: Understanding Preprocessing, Cleaning, and Labeling in Data Preparation

10 Jun 2024

Learn about techniques like tokenization, part-of-speech tagging, and feature extraction, ensuring your dataset is optimized for various tasks.

cover

Data Collection for ML Models: Strategies and Protocols for Ensuring Dataset Integrity

10 Jun 2024

Explore the intricate process of data collection: from acquisition methods to ethical considerations.

cover

Understanding Dataset Instances and Relationships

10 Jun 2024

Explore the composition of datasets, including instances, labels, data splits, errors, external resources, and considerations for confidentiality & sensitivity