Skip to content

Pull requests: datajuicer/data-juicer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] Feat: Add RayImageBTSMinhashDeduplicator
#897 opened Jan 29, 2026 by Dludora Loading…
[WIP] Add Camera Pose op dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs enhancement New feature or request
#894 opened Jan 27, 2026 by Qirui-jiao Loading…
Add Hand Reconstruction op (HaWoR) dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs enhancement New feature or request
#893 opened Jan 27, 2026 by Qirui-jiao Loading…
[feature] op-level isolated environment spec in ray mode dj:dist issues/PRs about distributed data processing dj:op issues/PRs about some specific OPs enhancement New feature or request environment related to third-party dependency, DJ-pypi, DJ-docker, etc.
#892 opened Jan 23, 2026 by HYLcool Loading…
[WIP] feat: Support iceberg、hudi、delta、hdfs data source. dj:core issues/PRs about the core functions of Data-Juicer dj:dataset issues/PRs about the dj-dataset
#875 opened Jan 6, 2026 by Dludora Loading…
Depth seg new op dj:op issues/PRs about some specific OPs
#862 opened Dec 22, 2025 by archernsy Loading…
Add Operator-Level Parallel Data Processing with Ray Actors dj:dist issues/PRs about distributed data processing dj:efficiency regarding to efficiency issues and enhancements enhancement New feature or request
#761 opened Aug 19, 2025 by Cccccc0630 Loading…
[NewOp] Add group_diversity_filter op
#745 opened Jul 22, 2025 by lingzhq Loading…
Add lidar object segmentation op
#736 opened Jul 14, 2025 by Qirui-jiao Loading…
[WIP] add lidar object detection op
#721 opened Jun 26, 2025 by Cathy0908 Loading…
[WIP] Optimization framework dj:core issues/PRs about the core functions of Data-Juicer dj:efficiency regarding to efficiency issues and enhancements
#702 opened Jun 13, 2025 by cyruszhang Loading…
[WIP] deduping benchmark suite
#607 opened Mar 4, 2025 by cyruszhang Loading…
Add humanvbench operators dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs good first issue Good for newcomers
#553 opened Jan 17, 2025 by SYSUzhouting Loading…
ProTip! Filter pull requests by the default branch with base:main.