Case Study

A leading autonomous driving company (e.g. Tesla) requires a large image labeled dataset to improve its vehicle recognition algorithms. The task involves labeling various elements in driving scenes, such as cars, pedestrians, traffic signs, and road conditions.

Step-by-Step Workflow:

Step 1: Task Creation

Requester (e.g. Tesla) submits a task on the DataCrowd platform, with specific requirements including:

  • Data Type: Images from dashboard cameras

  • Labeling Type: Bounding box labeling of cars, pedestrians, and traffic signs

  • Quantity: 100,000 images

  • Quality Requirement: Labeling accuracy must reach 90%

Step 2: Task Assignment Algorithm

DataCrowd uses a task assignment algorithm to distribute the task to contributors based on their skills, past performance, and engagement level.

Sample Algorithms:

  • Skill Matching Algorithm: The algorithm evaluates contributors past performance on similar tasks and matches them to the current task based on their skill level.

  • Dynamic Task Assignment: Contributors are ranked based on previous task completion, quality of past labeling tasks (accuracy score) and engagement level (frequency of contributions)

Task Assignment Example: Identifying Contributors A, B, and C as the best candidates:

  • Contributor A: 95% accuracy, high engagement

  • Contributor B: 85% accuracy, moderate engagement

  • Contributor C: 90% accuracy, low engagement

Result: The algorithm assigns 60% of the task to Contributor A, 30% to Contributor B, and 10% to Contributor C.

Step 3: Data Labeling

Contributors use a user-friendly interface to label images, allowing them to:

  • Draw bounding boxes around identified objects

  • Label each box with the relevant category (e.g., "car," "pedestrian," "traffic sign")

  • Provide additional metadata if necessary (e.g., weather conditions)

Step 4: Validation Algorithm

Once contributors submit their labels, the platform evaluates the quality of the work using a validation algorithm.

Validation Process:

  • Automatic Quality Check: The system uses algorithms to analyze the consistency and accuracy of the labeling samples. For example, comparing the labels with a pre-labeled data-set, or using object detection models to validate the position of the bounding boxes.

  • Peer Review System: A portion of tasks is sent to other contributors for peer review, ensuring an additional layer of quality control

Step 5: Reward Distribution

After the validation process, contributors receive rewards based on their performance metrics through the evaluation and reward system.

Last updated