
Data labeling pipeline
Upwork
Remoto
•11 hours ago
•No application
About
1. Data Capture & Ingestion • Understand existing data collection script 2. Preprocessing & Storage • Normalize formats (e.g., video codecs, messages types, sensor logs). • Store raw + processed data in structured cloud buckets. 3. Pipeline Integration • Convert ingested data into Label Studio–compatible formats (e.g., JSON, video frames, time-series). • Automate upload to Label Studio API. and create new annotations tasks as needed • Modify labels to match our requirements 4. Monitoring & QA • Logging & error handling for each pipeline stage. • Sample validation of data arriving in Label Studio. ⸻ Deliverables • Working ingestion service (scripts/services). • Preprocessing modules for each data type. • Automated uploader to Label Studio. • Documentation (deployment & usage guide).