-
Notifications
You must be signed in to change notification settings - Fork 29k
Pull requests: apache/spark
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[SPARK-42025][SHUFFLE] Improve logs level in removeBlocks
CORE
#54043
opened Jan 28, 2026 by
varun-lakhyani
Loading…
[SPARK-55263][PYTHON][INFRA] Upgrade Python linter from 3.11 to 3.12 in CI
BUILD
INFRA
#54042
opened Jan 28, 2026 by
Yicong-Huang
Loading…
[SPARK-55262][Geo][SQL] Block Geo types in all file based data sources except Parquet
AVRO
SQL
#54038
opened Jan 28, 2026 by
uros-db
Loading…
[SPARK-55259][Geo][SQL] Implement Parquet schema conversion for Geo types
SQL
#54037
opened Jan 28, 2026 by
uros-db
Loading…
[MINOR] Raise exception if no active Spark context
PYTHON
SQL
#54036
opened Jan 28, 2026 by
amarvin
Loading…
[SPARK-55258][DOCS] Document CLI parameters in declarative pipelines programming guide
DOCS
#54035
opened Jan 28, 2026 by
sryza
Loading…
[SPARK-55256][SQL] Support IGNORE NULLS / RESPECT NULLS for array_agg and collect_list
SQL
#54034
opened Jan 28, 2026 by
yaooqinn
Loading…
[SPARK-54830][TESTS][FOLLOWUP] Disable shuffle checksum for the test case of SPARK-48037 to avoid memory issues
BUILD
SQL
#54033
opened Jan 28, 2026 by
ivoson
Loading…
[WIP][SPARK-54276][BUILD] Bump Hadoop 3.4.3 RC0
BUILD
CORE
DOCS
KUBERNETES
SQL
#54029
opened Jan 28, 2026 by
pan3793
Loading…
[SPARK-55249][PYTHON] Make DataFrame.toJSON able to return dataframe
CONNECT
PYTHON
SQL
#54025
opened Jan 28, 2026 by
zhengruifeng
Loading…
[SPARK-55038][SQL] Fix wrong results for array_agg(DISTINCT) with AQE…
SQL
#54021
opened Jan 28, 2026 by
anirudh83
Loading…
[SPARK-55246][SS] Add Test for Pyspark TWS and TWSInPandas and Fix StatePartitionAllColumnFamiliesWriter Bug
PYTHON
SQL
STRUCTURED STREAMING
#54019
opened Jan 28, 2026 by
zifeif2
Loading…
[SPARK-55225][PYTHON][PS] Restore to the original dtype for Datetime
PANDAS API ON SPARK
PYTHON
#54017
opened Jan 28, 2026 by
gaogaotiantian
Loading…
[SPARK-55243][CONNECT] Allow setting binary headers via the -bin suffix in the Scala Connect client
CONNECT
SQL
#54016
opened Jan 27, 2026 by
dillitz
Loading…
[SPARK-55244][PYTHON][PS] Use np.nan as default value for pandas string types
PANDAS API ON SPARK
PYTHON
#54015
opened Jan 27, 2026 by
gaogaotiantian
Loading…
[SPARK-55228][SPARK-55230][SQL][CONNECT] Implement Dataset.zipWithIndex in Scala API
BUILD
CONNECT
SQL
#54014
opened Jan 27, 2026 by
fangchenli
Loading…
Upgrade Spark 4.1 + Affirm Specific Change
BUILD
CORE
KUBERNETES
PYTHON
#54012
opened Jan 27, 2026 by
li-isabella
•
Draft
[SPARK-55031][SQL] Add vector avg/sum aggregation function expressions
SQL
#54011
opened Jan 27, 2026 by
zhidongqu-db
Loading…
[SPARK-46167][PS] Add axis implementation to DataFrame.rank
PANDAS API ON SPARK
PYTHON
#54009
opened Jan 27, 2026 by
devin-petersohn
Loading…
[SPARK-54887] Add previously removed legacy error class back in
CONNECT
SQL
#54008
opened Jan 27, 2026 by
garlandz-db
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.