-
Notifications
You must be signed in to change notification settings - Fork 855
Pull requests: kubeflow/trainer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(runtimes): propagate Trainer.NumNodes into TemplateSpec (Parallelism/Completions)
kind/bug
size/L
#3057
opened Dec 24, 2025 by
NarayanaSabari
Loading…
fix(manifests): fix Prometheus metrics port mismatch
size/S
#3056
opened Dec 24, 2025 by
ChughShilpa
Loading…
1 task
chore(deps): bump tonic from 0.12.3 to 0.14.2 in /pkg/data_cache/test
approved
dependencies
Pull requests that update a dependency file
rust
Pull requests that update rust code
size/L
#3054
opened Dec 22, 2025 by
dependabot
bot
Loading…
feat: support for arm based trainer image using arm runner
size/L
#3046
opened Dec 21, 2025 by
jaiakash
Loading…
1 task done
fix: add
appVersion field to Helm chart for Kubeflow Trainer
size/S
#3044
opened Dec 18, 2025 by
milinddethe15
Loading…
1 task
fix(operator): fix TrainJob suspend/resume webhook error (#3008)
ok-to-test
size/L
#3041
opened Dec 16, 2025 by
JEETDESAI25
Loading…
chore(deps): bump iceberg-datafusion from 0.5.1 to 0.6.0 in /pkg/data_cache
dependencies
Pull requests that update a dependency file
rust
Pull requests that update rust code
size/XXL
#3034
opened Dec 8, 2025 by
dependabot
bot
Loading…
chore(deps): update huggingface-hub requirement from <1.2,>=0.27.0 to >=0.27.0,<1.3 in /cmd/initializers/dataset
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
size/XS
#3032
opened Dec 8, 2025 by
dependabot
bot
Loading…
chore(deps): update huggingface-hub requirement from <1.2,>=0.27.0 to >=0.27.0,<1.3 in /cmd/initializers/model
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
size/XS
#3030
opened Dec 8, 2025 by
dependabot
bot
Loading…
feat: Add the manager field to the podTemplateOverride object
ok-to-test
size/L
#3020
opened Dec 4, 2025 by
kaisoz
Loading…
1 task
feat(runtimes): add Pending and Running status conditions for TrainJob
do-not-merge/hold
size/L
#3019
opened Dec 4, 2025 by
RohitYandigeri
Loading…
fix(operator): Prevent JobSet recreation when its TTL has expired
size/S
#3013
opened Nov 27, 2025 by
astefanutti
Loading…
1 task done
chore(deps): bump arrow from 55.1.0 to 57.1.0 in /pkg/data_cache
dependencies
Pull requests that update a dependency file
rust
Pull requests that update rust code
size/L
#3005
opened Nov 25, 2025 by
dependabot
bot
Loading…
chore(deps): bump torch from 2.7.1 to 2.9.1 in /cmd/runtimes/deepspeed
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
size/XS
#2987
opened Nov 17, 2025 by
dependabot
bot
Loading…
fix: fix resourcePerNode override not applied with Volcano scheduler
size/L
#2982
opened Nov 17, 2025 by
sksingh2005
Loading…
1 task done
chore(deps): bump iceberg from 0.5.1 to 0.6.0 in /pkg/data_cache
dependencies
Pull requests that update a dependency file
rust
Pull requests that update rust code
size/L
#2968
opened Nov 9, 2025 by
dependabot
bot
Loading…
chore(deps): bump bincode from 1.3.3 to 2.0.1 in /pkg/data_cache
dependencies
Pull requests that update a dependency file
rust
Pull requests that update rust code
size/S
#2967
opened Nov 9, 2025 by
dependabot
bot
Loading…
chore(deps): bump tonic from 0.12.3 to 0.14.2 in /pkg/data_cache
dependencies
Pull requests that update a dependency file
rust
Pull requests that update rust code
size/M
#2964
opened Nov 9, 2025 by
dependabot
bot
Loading…
feat(examples): Add kubectl-friendly YAML examples for TrainJob and TrainingRuntime
size/XXL
#2925
opened Nov 6, 2025 by
NarayanaSabari
Loading…
feat: KEP 2841 Flux Policy to support Flux Framework
lgtm
ok-to-test
size/L
#2909
opened Oct 31, 2025 by
vsoch
Loading…
1 task done
feat(docs): KEP-2779: Track TrainJob progress and expose training metrics
ok-to-test
size/XL
#2905
opened Oct 28, 2025 by
robert-bell
Loading…
chore: Add comprehensive unit tests for Config API
size/XXL
#2893
opened Oct 16, 2025 by
kapil27
Loading…
1 task
chore: Add Speech Recognition with DDP Example
ok-to-test
ok-to-test-gpu-runner
size/XXL
#2830
opened Sep 15, 2025 by
zren11
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.