Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
10000 commits
Select commit Hold shift + click to select a range
45cf928
GH-47787: [C++][FlightRPC] ODBC `msi` Windows installer (#48054)
alinaliBQ Jan 8, 2026
fdeac0b
GH-48770: [CI] Add missing permissions declaration to workflows (#48771)
HyukjinKwon Jan 8, 2026
28a497f
GH-48752: [Ruby] Skip ChunkedArray test on Windows due to flakiness (…
HyukjinKwon Jan 8, 2026
0760523
GH-48461: [R][CI] Migrate Azure pipelines to GitHub actions (#48585)
raulcd Jan 8, 2026
00a97fe
GH-48776: [CI][Ruby][Windows] Ensure removing temporary files (#48777)
kou Jan 8, 2026
75f78a6
MINOR: [Docs] Typo in building.rst (#48768)
tadeja Jan 8, 2026
5b1af01
MINOR: [C++][Compute] Remove resolved TODO comments in case_when null…
HyukjinKwon Jan 8, 2026
fe5e0e5
GH-48780: [CI] Add missing permissions for reusable workflow calls (#…
HyukjinKwon Jan 8, 2026
4eccdad
GH-48741: [C++] Fix deadlock in CSV AsyncThreadedTableReader destruct…
pitrou Jan 8, 2026
ba2ccae
GH-48784: [GLib] Make (system) Parquet C++ is optional (#48785)
kou Jan 9, 2026
81e4046
MINOR: [Python] Remove impossible TODO in server middleware (#48794)
HyukjinKwon Jan 9, 2026
0c49503
GH-48755: [MATLAB] Rename getArrayProxyIDs to getProxyIDs (#48756)
HyukjinKwon Jan 9, 2026
ff3ca55
GH-48707: [C++][FlightRPC] Use IRD precision/scale defaults with ARD …
HyukjinKwon Jan 9, 2026
b126c6e
GH-47724: [C++][FlightRPC] ODBC: implement SQLDescribeCol (#48052)
alinaliBQ Jan 9, 2026
ddab347
GH-24157: [C++] Add tests for DayTimeIntervalBuilder (#48709)
HyukjinKwon Jan 9, 2026
2805afd
GH-48782: [Docs][CI] Skip Markdown files with doxygen and trigger Doc…
raulcd Jan 9, 2026
c43a2f0
MINOR: [Dev] Remove non active collaborators and add new ones (#48804)
raulcd Jan 9, 2026
2009cc9
GH-48807: [CI] Clean up space on GitHub runner to fix manylinux wheel…
raulcd Jan 9, 2026
6c7de4e
GH-48811: [C++][FlightRPC] ODBC: Add missing `arrow::` to fix build (…
alinaliBQ Jan 10, 2026
7fe39d4
MINOR: [R] Add the explicit mirror to install R dependencies (#48649)
HyukjinKwon Jan 10, 2026
a0f3d7b
MINOR: [R] Remove outdated TODO for integer division by zero tests (#…
HyukjinKwon Jan 10, 2026
3fb84ff
GH-48629: [R] Add tests for duplicate column names and incompatible t…
HyukjinKwon Jan 10, 2026
1e8645e
GH-48660: [R] Add tests for filter() and arrange() with aggregation e…
HyukjinKwon Jan 11, 2026
ec56b37
GH-48787: [C++] Disable `-Werror` for s2n-tls (#48791)
meher745 Jan 11, 2026
8995d61
GH-48668: [Python][Docs] Add python examples for compute functions `m…
zhengruifeng Jan 12, 2026
b90a2b8
GH-48314: [Python] Compat with pandas 3.0 changed default datetime un…
jorisvandenbossche Jan 12, 2026
8955bbf
GH-48631: [R] Non-API calls: 'ATTRIB', 'SET_ATTRIB' (#48634)
thisisnic Jan 12, 2026
342c5d5
GH-48809: [CI] Fix homebrew-cpp with Mac by using formula-based depen…
HyukjinKwon Jan 12, 2026
c08bcc6
GH-48827: [CI][Python] Add required xz dependency to emscripten docke…
raulcd Jan 12, 2026
985b16e
GH-48582: [CI][GPU][C++][Python] Add new CUDA jobs using the new self…
raulcd Jan 12, 2026
6657a92
GH-48799: [C++] Improve SharedExclusiveChecker error messages (#48800)
HyukjinKwon Jan 12, 2026
4e4026c
MINOR: [CI] Bump actions/upload-artifact from 4 to 6 (#48833)
dependabot[bot] Jan 13, 2026
cb100b2
GH-48594: [C++][FlightRPC] Fix ODBC CI Long Build Time Issue (#48595)
alinaliBQ Jan 13, 2026
2d92e3d
GH-48820: [Ruby] Add support for writing null array (#48821)
kou Jan 13, 2026
288cd43
GH-48623: [CI][Archery][Dev] Add missing headers to email reports (#4…
kou Jan 13, 2026
f82dede
GH-44817: [CI] Comment new repo url on issues of moved components (#4…
assignUser Jan 13, 2026
cff2c52
GH-40053: [Python] Preserve dict key order when inferring struct type…
gabrielsimoes Jan 13, 2026
4e5f7ea
GH-48838: [Release] Use gh cli to download sources for Linux packages…
raulcd Jan 13, 2026
4f77a06
GH-48841: [Release][Package] Add GH_TOKEN to rake build step on Linux…
raulcd Jan 13, 2026
d54a205
GH-48759: [Python] Remove skip condition for pandas/issues/50127 (#48…
HyukjinKwon Jan 13, 2026
fed23f3
GH-48844: [C++] Check IPC Message body length consistency in IPC file…
pitrou Jan 13, 2026
0ef674b
GH-48834 [C++][FlightRPC][Doc] Add instructions to run ODBC tests in …
alinaliBQ Jan 14, 2026
a1ec5a9
GH-47167: [C++][Dev] Update clang-format dependency (#47168)
AntoinePrv Jan 14, 2026
9866071
GH-48848: [Dev] Remove obsolete Java, Go, and Swift entries from .git…
HyukjinKwon Jan 14, 2026
962d051
GH-34785: [C++][Parquet] Add bloom filter write support (#37400)
mapleFU Jan 14, 2026
9a5ddb0
GH-48817 [R][C++] Bump C++20 in R build infrastructure (#48819)
jonkeane Jan 14, 2026
b14c6e0
GH-48856: [Release] Update copyright NOTICE year to 2026 (#48857)
raulcd Jan 15, 2026
d65eb94
MINOR: [C++] Remove TODO asking why null count set unknown (#48823)
HyukjinKwon Jan 15, 2026
fb0bac6
GH-48311: [C++] Fix OOB memory access in buffered IO (#48322)
chegoryu Jan 15, 2026
7820f67
GH-34785: [Doc][Parquet] Update doc for bloom filter support (#48860)
wgtmac Jan 15, 2026
ddcf2dc
MINOR: [CI][Release] Fix incorrect path in release_candidate.yml (#48…
amoeba Jan 16, 2026
25b7247
GH-48119: [C++][ODBC] Move class definitions to type_fwd.h (#48596)
AestheticAkhmad Jan 16, 2026
6ec9162
GH-48861: [CI] Fix wrong `smtplib.SMTP.send_message` usage (#48876)
kou Jan 16, 2026
435d8ba
GH-48664: [R] Implement support for keepNA = FALSE in base::nchar() (…
HyukjinKwon Jan 18, 2026
97aa9e4
MINOR: [Release] Update versions for 24.0.0-SNAPSHOT
raulcd Jan 18, 2026
06800ad
MINOR: [Release] Update .deb package names for 24.0.0
raulcd Jan 18, 2026
2ac912e
MINOR: [Release] Update .deb/.rpm changelogs for 23.0.0
raulcd Jan 18, 2026
34045db
GH-48560: [C++][Parquet] When fuzzing, treat Table validation error a…
pitrou Jan 19, 2026
e4c9ed2
GH-48884: [Dev][Release] Remove non-published draft release candidate…
raulcd Jan 19, 2026
8b648be
GH-48877: [C++][Parquet] Fix writer not to throw for bloom filter on …
wgtmac Jan 19, 2026
97c656b
GH-48888: [Ruby] Add support for writing boolean array (#48889)
kou Jan 19, 2026
7b99930
GH-48853: [Release] Fix bytes to string comparison in download_rc_bin…
shashbha14 Jan 20, 2026
894d6a3
GH-48872: [C++][FlightRPC][CI][Packaging] Upload ODBC to Nightly Rele…
alinaliBQ Jan 20, 2026
c183167
GH-48894: [Python][C++] Use base Azure::Core::RequestFailedException …
raulcd Jan 20, 2026
e78abb9
GH-48890: [CI][Packaging][APT] Remove needless packages in downgrade …
kou Jan 20, 2026
d6b4903
GH-48869: [Doc] Add runs-on and AWS to Continuous Integration Sponsor…
raulcd Jan 20, 2026
c5b3622
GH-48910: [Ruby] Add support for writing int8/uint8 arrays (#48911)
kou Jan 21, 2026
b5e495d
GH-48858: [C++][Parquet] Avoid re-serializing footer for signature ve…
pitrou Jan 21, 2026
421a475
GH-48900: [C++] Avoid memory blowup with excessive variadic buffer co…
pitrou Jan 21, 2026
ed35594
GH-48897: [C++] Benchmark and optimize CountSetBits (#48898)
pitrou Jan 21, 2026
3ed9169
GH-48864: [C++] Support customizing more Zstd parameters (#48865)
HuaHuaY Jan 21, 2026
02e37e2
GH-48721: [C++] Add test for file creation with UTF-8 filenames (#48722)
HyukjinKwon Jan 21, 2026
8e13dbc
GH-44926: [R] Remove usage of cpp11's cpp11/altrep.hpp and HAS_ALTREP…
thisisnic Jan 21, 2026
b29b786
GH-48916: [Ruby] Add support for writing binary array (#48917)
kou Jan 22, 2026
86bca25
GH-48921: [C++] Bump mimalloc to 3.2.7 (#48826)
pitrou Jan 22, 2026
f5b3eb5
GH-48132: [Ruby] Add support for writing int16/32/64 and uint16/32/64…
kou Jan 22, 2026
044ca4d
GH-48937: [Ruby] Add support for writing UTF-8 array (#48938)
kou Jan 22, 2026
9cde706
GH-48928: [R] Update NEWS for 23.0.0 release (#48930)
thisisnic Jan 22, 2026
9489a66
GH-48885: [C++] Add missing curl dependency of `Arrow::arrow_static` …
kou Jan 23, 2026
9f5a5c7
GH-48942: [Ruby] Add support for writing float32/64 arrays (#48944)
kou Jan 23, 2026
07c1c44
GH-48945: [Ruby] Add support for writing large binary array (#48946)
kou Jan 23, 2026
75ef031
GH-48394: [C++][Parquet] Add arrow::Result version of parquet::arrow:…
fenfeng9 Jan 23, 2026
7bc47c0
GH-48160: [C++][Gandiva] Pass CPU attributes to LLVM (#48161)
lriggs Jan 25, 2026
64ce4bd
GH-48973: [R][C++] Fix RE2 compilation errors under C++20 (#48976)
jonkeane Jan 25, 2026
e21d057
GH-48880: [Ruby] Fix a bug that Arrow::ExecutePlan nodes may be GC-ed…
kou Jan 26, 2026
1779d26
GH-48637: [C++][FlightRPC] ODBC: Disable `absl` deadlock detection (#…
alinaliBQ Jan 26, 2026
95a3ed4
GH-48965: [Python][C++] Compare unique_ptr for CFlightResult or CFlig…
raulcd Jan 26, 2026
8010794
GH-48924: [C++][CI] Fix pre-buffering issues in IPC file reader (#48925)
pitrou Jan 26, 2026
5272a68
GH-48966: [C++] Fix cookie duplication in the Flight SQL ODBC driver …
jmao-denver Jan 27, 2026
cb61dfe
GH-48691: [C++][Parquet] Write serializer may crash if the value buff…
boneanxs Jan 27, 2026
790ed2c
GH-48947 [CI][Python] Install pymanager.msi instead of pymanager.msix…
raulcd Jan 27, 2026
685873e
GH-48990: [Ruby] Add support for writing date arrays (#48991)
kou Jan 27, 2026
b85f20b
GH-48992: [Ruby] Add support for writing large UTF-8 array (#48993)
kou Jan 27, 2026
eb1525e
GH-48949: [C++][Parquet] Add Result versions for parquet::arrow::File…
fenfeng9 Jan 27, 2026
a7343ed
GH-48985: [GLib][Ruby] Fix GC problems in node options and expression…
kou Jan 27, 2026
515f724
GH-47692: [CI][Python] Do not fallback to return 404 if wheel is foun…
raulcd Jan 27, 2026
de06a52
GH-48912: [R] Configure C++20 in conda R on continuous benchmarking (…
thisisnic Jan 27, 2026
0d0e068
GH-36889: [C++][Python] Fix duplicate CSV header when first batch is …
rynewang Jan 27, 2026
5a71d2a
GH-48932: [C++][Packaging][FlightRPC] Fix `rsync` build error ODBC Ni…
alinaliBQ Jan 28, 2026
7540ad8
GH-48951: [Docs] Add documentation relating to AI tooling (#48952)
thisisnic Jan 28, 2026
939dd95
GH-49029: [Doc] Run sphinx-build in parallel (#49026)
pitrou Jan 28, 2026
5fef228
GH-33450: [C++] Remove GlobalForkSafeMutex (#49033)
pitrou Jan 28, 2026
4ea567c
GH-35437: [C++] Remove obsolete TODO about DictionaryArray const& ret…
HyukjinKwon Jan 28, 2026
ec827d6
GH-48586: [Python][CI] Upload artifact to python-sdist job (#49008)
raulcd Jan 28, 2026
d2fcaa6
MINOR: [R] Add 22.0.0.1 to compatiblity matrix (#49039)
thisisnic Jan 28, 2026
811a273
GH-48961: [Docs][Python] Doctest fails on pandas 3.0 (#48969)
tadeja Jan 28, 2026
0e9e32f
GH-49037: [Benchmarking] Install R from non-conda source for benchmar…
thisisnic Jan 28, 2026
1880d3a
GH-49042: [C++] Remove mimalloc patch (#49041)
pitrou Jan 28, 2026
debc30c
GH-49024: [CI] Update Debian version in `.env` (#49032)
pitrou Jan 28, 2026
5d3014a
GH-49027: [Ruby] Add support for writing time arrays (#49028)
kou Jan 28, 2026
f797801
GH-49030: [Ruby] Add support for writing fixed size binary array (#49…
kou Jan 28, 2026
08175e5
GH-48866: [C++][Gandiva] Truncate subseconds beyond milliseconds in `…
akravchukdremio Jan 29, 2026
3384596
GH-48673: [C++] Fix ToStringWithoutContextLines to check for :\d+ pat…
HyukjinKwon Jan 29, 2026
86b860f
GH-49044: [CI][Python] Fix test_download_tzdata_on_windows by adding …
rok Jan 29, 2026
222fac7
GH-48983: [Packaging][Python] Build wheel from sdist using build and …
raulcd Jan 29, 2026
3e6182a
GH-49059: [C++] Fix issues found by OSS-Fuzz in IPC reader (#49060)
pitrou Jan 29, 2026
aae5840
GH-49055: [Ruby] Add support for writing decimal128/256 arrays (#49056)
kou Jan 30, 2026
d510b10
GH-49053: [Ruby] Add support for writing timestamp array (#49054)
kou Jan 30, 2026
12cdb09
GH-28859: [Doc][Python] Use only code-block directive and set up doct…
AlenkaF Jan 30, 2026
8a77885
GH-49065: [C++] Remove unnecessary copies of shared_ptr in Type::BOOL…
HyukjinKwon Jan 30, 2026
e40efd8
GH-48159 [C++][Gandiva] Projector make is significantly slower after …
lriggs Jan 30, 2026
adef2ef
GH-49043: [C++][FS][Azure] Avoid bugs caused by empty first page(s) f…
Tom-Newton Jan 30, 2026
85c18a0
GH-49034 [C++][Gandiva] Fix binary_string to not trigger error for nu…
lriggs Jan 30, 2026
9106671
GH-48980: [C++] Use COMPILE_OPTIONS instead of deprecated COMPILE_FLA…
HyukjinKwon Jan 30, 2026
c6090ed
GH-49069: [C++] Share Trie instances across CSV value decoders (#49070)
HyukjinKwon Jan 30, 2026
5afdf0a
GH-49076: [CI] Update vcpkg baseline to newer version (#49062)
raulcd Jan 31, 2026
acb6288
GH-49074: [Ruby] Add support for writing interval arrays (#49075)
kou Jan 31, 2026
384ea25
GH-49071: [Ruby] Add support for writing list and large list arrays (…
kou Jan 31, 2026
235841d
GH-49087 [CI][Packaging][Gandiva] Add support for LLVM 15 or earlier …
lriggs Jan 31, 2026
15b7494
GH-49100: [Docs] Broken link to Swift page in implementations.rst (#4…
chilin0525 Jan 31, 2026
dbca958
GH-49096: [Ruby] Add support for writing struct array (#49097)
kou Feb 2, 2026
93c4e00
GH-49093: [Ruby] Add support for writing duration array (#49094)
kou Feb 2, 2026
3a1cb86
GH-49098: [Packaging][deb] Add missing libarrow-cuda-glib-doc (#49099)
kou Feb 2, 2026
699473f
GH-48764: [C++] Update xsimd (#48765)
AntoinePrv Feb 2, 2026
644ec57
GH-46008: [Python][Benchmarking] Remove unused asv benchmarking files…
raulcd Feb 2, 2026
012fd17
GH-49108: [Python] SparseCOOTensor.__repr__ missing f-string prefix (…
chilin0525 Feb 2, 2026
3b9d90c
GH-49083: [CI][Python] Remove dask-contrib/dask-expr from the nightly…
AlenkaF Feb 3, 2026
262e4e1
GH-49117: [Ruby] Add support for writing union arrays (#49118)
kou Feb 3, 2026
db349f1
GH-49119: [Ruby] Add support for writing map array (#49120)
kou Feb 3, 2026
7532327
GH-48922: [C++] Support Status-returning callables in Result::Map (#4…
abhishek593 Feb 3, 2026
c0d5a59
GH-49003: [C++] Don't consider `out_of_range` an error in float parsi…
Alvaro-Kothe Feb 3, 2026
7dacbd0
GH-48941: [C++] Generate proper UTF-8 strings in JSON test utilities …
HyukjinKwon Feb 3, 2026
bfc698e
GH-49067: [R] Disable GCS on macos (#49068)
jonkeane Feb 3, 2026
d2315fe
GH-49115: [CI][Packaging][Python] Update vcpkg baseline for our wheel…
raulcd Feb 4, 2026
961258d
GH-48954: [C++] Add test for null-type dictionary sorting and clarify…
HyukjinKwon Feb 4, 2026
2d447c4
GH-36193: [R] arm64 binaries for R (#48574)
thisisnic Feb 4, 2026
33f1ea5
GH-48397: [R] Update docs on how to get our libarrow builds (#48995)
thisisnic Feb 4, 2026
e37c516
GH-49104: [C++] Fix Segfault in SparseCSFIndex::Equals with mismatche…
AliRana30 Feb 4, 2026
f895512
MINOR: [Docs] Add links to AI-generated code guidance (#49131)
thisisnic Feb 4, 2026
f0de008
MINOR: [R] Add new vignette to pkgdown config (#49145)
thisisnic Feb 4, 2026
1cd1841
GH-49150: [Doc][CI][Python] Doctests failing on rst files due to pand…
rok Feb 5, 2026
ef5854a
GH-41990: [C++] Fix AzureFileSystem compilation on Windows (#48971)
nateprewitt Feb 5, 2026
6a2d09b
GH-49138: [Packaging][Python] Remove nightly cython install from many…
AlenkaF Feb 5, 2026
49423f8
GH-33459: [C++][Python] Support step >= 1 in list_slice kernel (#48769)
HyukjinKwon Feb 5, 2026
d31644a
GH-41863: [Python][Parquet] Support lz4_raw as a compression name ali…
nwoolmer Feb 5, 2026
f39f275
GH-48868: [Doc] Document security model for the Arrow formats (#48870)
pitrou Feb 5, 2026
baba1d4
GH-49004: [C++][FlightRPC] Run ODBC tests in workflow using `cpp_test…
alinaliBQ Feb 6, 2026
654fb28
GH-49092: [C++][FlightRPC][CI] Nightly Packaging: Add `dev-yyyy-mm-dd…
alinaliBQ Feb 6, 2026
0dfae70
GH-49156: [Python] Require GIL for string comparison (#49161)
raulcd Feb 6, 2026
d5fa7cb
GH-48575: [C++][FlightRPC] Standalone ODBC macOS CI (#48577)
alinaliBQ Feb 7, 2026
7fcc0af
GH-49164: [C++] Avoid invalid if() args in cmake when arrow is a subp…
zanmato1984 Feb 7, 2026
252a685
GH-48132: [Ruby] Add support for writing dictionary array (#49175)
kou Feb 8, 2026
a82edf9
GH-49081: [C++][Parquet] Correct variant's extension name (#49082)
HuaHuaY Feb 9, 2026
bb81a6e
GH-49102: [CI] Add type checking infrastructure and CI workflow for t…
rok Feb 9, 2026
02d05f4
GH-49190: [C++][CI] Fix `unknown job 'odbc' error` in C++ Extra Workf…
alinaliBQ Feb 9, 2026
649e1b6
MINOR: [CI] Bump docker/login-action from 3.6.0 to 3.7.0 (#49191)
dependabot[bot] Feb 9, 2026
f32f8ac
GH-48904: [C++][FlightRPC][CI][Packaging] Upload ODBC installer into …
alinaliBQ Feb 10, 2026
a6b45b6
GH-49081: [C++][Parquet][FOLLOWUP] Correct variant's extension name (…
HuaHuaY Feb 10, 2026
8b83784
GH-49159: [C++][Gandiva] Detect overflow in repeat() (#49160)
kou Feb 10, 2026
bc48921
GH-49208: [Ruby] Add support for writing dictionary delta message (#4…
kou Feb 11, 2026
2737d91
GH-43352: [Docs][Python] Add all tensor classes documentation (#49147)
chilin0525 Feb 11, 2026
6cce6be
GH-49187: [Doc] Fix versions.json for Arrow 1.0 (#49224)
AlenkaF Feb 11, 2026
e11aeee
GH-49217: [C++][Parquet] Fix map type to preserve key-value metadata …
wgtmac Feb 11, 2026
3fd809f
GH-44655: [C++][Python] Enable building AzureFileSystem in PyArrow wh…
nateprewitt Feb 12, 2026
2b9ed29
MINOR: [CI] Update Python CI to use Azure in Windows tests (#49240)
nateprewitt Feb 12, 2026
a444c87
MINOR: [CI] Set max-parallel=20 (#49228)
rok Feb 12, 2026
68d1368
GH-48862: [C++][Integration] Build arrow_c_data_integration library r…
raulcd Feb 12, 2026
4b19bf0
GH-49084: [CI][Dev] Wait for odbc-nightly before executing CPP extra …
raulcd Feb 12, 2026
9ed8fc1
GH-49184: [CI] AMD64 macOS 15-intel Python 3 consistently times out (…
tadeja Feb 12, 2026
bb78dff
GH-49233: [CI][Python] Update anaconda-client to 1.14.1 to support la…
nateprewitt Feb 12, 2026
a272046
GH-49225: [Ruby] Add support for writing dictionary delta for primiti…
kou Feb 12, 2026
29d34e8
GH-49234: [CI][Python] Nightly sdist job fails due to missing update_…
rok Feb 13, 2026
134638d
GH-49144: [R][CI] Get rid of unused CentOS CI job (#49239)
thisisnic Feb 13, 2026
e2aad46
GH-48576: [C++][FlightRPC] ODBC: add Mac setup script (#48578)
alinaliBQ Feb 15, 2026
1d76e1e
GH-49176: [C++] CRAN build fail on missing std::floating_point concep…
jonkeane Feb 16, 2026
4ba70c5
GH-49229: [C++] Fix abort when reading IPC file with a union validity…
pitrou Feb 16, 2026
f2aea05
MINOR: [Release] Update versions for 24.0.0-SNAPSHOT
raulcd Feb 16, 2026
63f20c4
MINOR: [Release] Update .deb/.rpm changelogs for 23.0.1
raulcd Feb 16, 2026
3e6988a
GH-48998: [R] Add note to docs on validating IPC streams (#48999)
thisisnic Feb 16, 2026
1114958
GH-49186: [R] Support dplyr::filter_out() in Arrow dplyr backend (#49…
larry77 Feb 16, 2026
76f7815
GH-49248: [Release] Include checksum in vote email (#49249)
amoeba Feb 17, 2026
7c45228
GH-48846: [C++] Read message metadata and body in one go in IPC file …
abhishek593 Feb 17, 2026
ebaaf07
GH-49146: [C++] Add option to disable atfork handlers (#49148)
pitrou Feb 17, 2026
c8e069d
GH-49307: [Benchmarks] Revert rig-based R installation in benchmark h…
thisisnic Feb 17, 2026
4a4718f
GH-49263: [Python][CI] Install rust compiler for libcst only on Debia…
raulcd Feb 17, 2026
aa564a9
GH-49169: [C++] Add ApplicationId to AzureFileSystem for SDK calls (#…
nateprewitt Feb 18, 2026
b6eb617
GH-49311: [C++][CI] Use differential fuzzing on IPC file fuzzer (#49312)
pitrou Feb 18, 2026
6c86961
GH-49318: [Ruby] Ensure using extpp 0.1.2 or later (#49319)
kou Feb 18, 2026
59e0ba6
GH-49299: [C++][Parquet] Integer overflow in Parquet dict decoding (#…
pitrou Feb 18, 2026
2691103
GH-48591: [C++] Remove some bit utils from bit_utils.h and replace th…
Anakin100100 Feb 18, 2026
6080027
GH-38184: [C++] Add systematic tests for Builder::AppendArraySlice (#…
abhishek593 Feb 18, 2026
b5eb42e
GH-49287: [C++][R] Clean up any other C++20 partial compatibility iss…
jonkeane Feb 18, 2026
9cf8f33
GH-49323: [R] Update NEWS.md for 23.0.1 (#49324)
thisisnic Feb 18, 2026
99984fd
GH-46531: [C++] Add type_singleton utility function and tests. (#47922)
harshkumar-2005 Feb 18, 2026
aea1ad3
GH-49325: [C++] Check if YMM register saving is OS enabled (#49326)
AntoinePrv Feb 18, 2026
031a2a4
GH-49316: [Ruby] Add support for auto dependency install for red-arro…
kou Feb 19, 2026
c20a04a
GH-35806: [R] Improve error message for null type inference with spar…
thisisnic Feb 19, 2026
99d1304
Revert "GH-35806: [R] Improve error message for null type inference w…
thisisnic Feb 19, 2026
8a16de6
GH-49330: [R] Update docs to reflect removal of OpenSSL 1.0 and 1.1 s…
thisisnic Feb 19, 2026
74026cd
GH-49292: [C++] Add support for armv8 or later (#49337)
kou Feb 20, 2026
ba61297
GH-48978: [Python] test failures on pandas 3.0 for fastparquet and fo…
tadeja Feb 20, 2026
b7ed8df
GH-41624: [C++] Add missing system Boost dependency to bundled Apache…
kou Feb 20, 2026
57473ad
GH-49314: [CI][Packaging][deb] Add support for minor/patch release in…
kou Feb 21, 2026
0cf32b2
GH-49114: [C++][Parquet] Fix converting schema failure with deep nest…
HuaHuaY Feb 21, 2026
8c27898
GH-49356: [C++] Remove deprecated APIs from v13.0.0 and v18.0.0 (#49171)
AliRana30 Feb 21, 2026
84a5701
GH-49364: [Ruby] Simplify reader tests (#49365)
kou Feb 22, 2026
61741aa
GH-48145: [R] Update to testthat 3.3.0 and use its expect_r6_class() …
thisisnic Feb 23, 2026
cbe2618
GH-49369: [C++][R] Deal with validating libtool again (#49370)
jonkeane Feb 24, 2026
8b50bb1
GH-49266: [C++][Parquet] Optimize delta bit-packed decoding when bit-…
pitrou Feb 24, 2026
376afb8
GH-49340: [R] Preserve row order in `write_dataset()` (#49343)
marberts Feb 24, 2026
f3f1eb0
GH-49341: [Packaging] Add support for Ubuntu 26.04 (#49345)
kou Feb 24, 2026
2fcc3ec
GH-38007: [C++] Add VariableShapeTensor implementation (#38008)
rok Feb 24, 2026
dc5b76e
GH-49389: [Ruby] Add support for custom metadata in field and schema …
kou Feb 26, 2026
924291b
GH-49393: [C++][FlightRPC][DOC] Add limitations to ODBC ReadMe Doc (#…
alinaliBQ Feb 26, 2026
8c0c4a8
GH-49293: [Packaging][deb] Update `debian/watch` for version 5 (#49294)
kou Feb 26, 2026
c9dfb3e
GH-49366: [CI][C++] Use system LLVM to use libstdc++ in gcc-toolset (…
kou Feb 26, 2026
a0c9a4b
GH-48277: [C++][Parquet] unpack with shuffle algorithm (#47994)
AntoinePrv Feb 26, 2026
879bbb3
GH-49400: [Ruby] Add `Arrow::FixedSizeList#values` and `#raw_records`…
kou Feb 27, 2026
007903c
GH-49371: [C++] Work around bit_width not being available on MacOS's …
jonkeane Feb 28, 2026
1cac811
GH-49406: [Ruby] Add support for fixed size list array (#49407)
kou Mar 1, 2026
aae49e8
GH-49417: [GLib] Add `garrow_map_data_type_is_keys_sorted()` (#49418)
kou Mar 2, 2026
0124d5b
GH-49078: [FS][Azure] Fix lossy pickling of `SubTreeFileSystem(base_p…
Tom-Newton Mar 2, 2026
96ecb13
GH-49424: [C++] Use std::bit_width instead of missing std::log2p1 on …
raulcd Mar 2, 2026
0bddf5d
GH-49415: [C++] Don't change map type key/item/value field names (#49…
kou Mar 2, 2026
693bcfe
MINOR: [CI] Bump actions/upload-artifact from 6 to 7 (#49430)
dependabot[bot] Mar 3, 2026
3ff230c
MINOR: [CI] Bump actions/download-artifact from 7 to 8 (#49431)
dependabot[bot] Mar 3, 2026
c48e81e
GH-48593: [C++] C++20: use standard calendar / timezone APIs (#48601)
rok Mar 3, 2026
cfbbf70
GH-49329: [C++][Parquet][CI] Add fuzz target for encoder/decoder roun…
pitrou Mar 3, 2026
c187333
GH-48241: [Python] Scalar inferencing doesn't infer UUID (#48727)
tadeja Mar 3, 2026
edfb24d
GH-49426: [Python] Do not build pyarrow-stubs on emscripten builds (#…
raulcd Mar 3, 2026
c61df29
GH-49428: [C++][Gandiva] Add support for LLVM 22.1.0 (#49429)
raulcd Mar 4, 2026
d42c7dd
GH-47279: [C++] Implement GetByteRangesArray for view types (#47418)
robert3005 Mar 4, 2026
65993cd
GH-49434: [C++][CI] Add golden integration files to IPC file fuzz cor…
pitrou Mar 4, 2026
0686130
GH-49382: [Python] Enable OpenTelemetry on PyArrow wheels (#49383)
raulcd Mar 4, 2026
855a4be
feat(gandiva): Add timestamp precision support infrastructure and ext…
telemenar Mar 18, 2026
938eaed
feat(gandiva): Add precision support for all extract functions
telemenar Mar 18, 2026
55b1c58
feat(gandiva): Add precision support for date_trunc functions
telemenar Mar 18, 2026
2163e0c
feat(gandiva): Add precision support for timestamp arithmetic functions
telemenar Mar 18, 2026
e6be364
feat(gandiva): Add precision support for cast and conversion functions
telemenar Mar 18, 2026
519aad5
feat(gandiva): Add remaining precision-aware datetime functions
telemenar Mar 18, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
46 changes: 46 additions & 0 deletions .asf.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

github:
description: "Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics"
homepage: https://arrow.apache.org/
labels:
- arrow
- parquet

features:
issues: true
projects: true
discussions: true

protected_branches:
# protect main against force push
main: {}

collaborators:
- alinaliBQ
- EnricoMi
- hiroyuki-sato
- HyukjinKwon

notifications:
commits: commits@arrow.apache.org
issues_status: issues@arrow.apache.org
issues: github@arrow.apache.org
pullrequests: github@arrow.apache.org
discussions: user@arrow.apache.org
jira_options: link label worklog
23 changes: 23 additions & 0 deletions .clang-format
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.
---
BasedOnStyle: Google
ColumnLimit: 90
DerivePointerAlignment: false
IncludeBlocks: Preserve
IndentPPDirectives: AfterHash
QualifierAlignment: Left
38 changes: 38 additions & 0 deletions .clang-tidy
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.
---
Checks: |
clang-diagnostic-*,
clang-analyzer-*,
-clang-analyzer-alpha*,
google-*,
modernize-*,
-modernize-avoid-c-arrays,
-modernize-use-trailing-return-type,
-modernize-use-nodiscard,
# produce HeaderFilterRegex from cpp/build-support/lint_exclusions.txt with:
# echo -n '^('; sed -e 's/*/\.*/g' cpp/build-support/lint_exclusions.txt | tr '\n' '|'; echo ')$'
HeaderFilterRegex: '^(.*codegen.*|.*_generated.*|.*windows_compatibility.h|.*pyarrow_api.h|.*pyarrow_lib.h|.*python/config.h|.*python/platform.h|.*thirdparty/ae/.*|.*vendored/.*|.*RcppExports.cpp.*|)$'
CheckOptions:
- key: google-readability-braces-around-statements.ShortStatementLines
value: '1'
- key: google-readability-function-size.StatementThreshold
value: '800'
- key: google-readability-namespace-comments.ShortNamespaceLines
value: '10'
- key: google-readability-namespace-comments.SpacesBeforeComments
value: '2'
File renamed without changes.
51 changes: 51 additions & 0 deletions .dockerignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

# Turn .dockerignore to .dockerallow by excluding everything and explicitly
# allowing specific files and directories. This enables us to quickly add
# dependency files to the docker content without scanning the whole directory.
# This setup requires to all of our docker containers have arrow's source
# as a mounted directory.

# exclude everything
**

# include explicitly
!ci/**
!c_glib/Gemfile
!dev/release/setup-*.sh
!docs/requirements*.txt
!go/go.mod
!go/go.sum
!python/requirements*.txt
!r/DESCRIPTION
!ruby/Gemfile
!ruby/red-arrow/Gemfile
!ruby/red-arrow/lib/arrow/version.rb
!ruby/red-arrow/red-arrow.gemspec
!ruby/red-arrow-cuda/Gemfile
!ruby/red-arrow-cuda/lib/arrow-cuda/version.rb
!ruby/red-arrow-cuda/red-arrow-cuda.gemspec
!ruby/red-arrow-format/Gemfile
!ruby/red-arrow-format/lib/arrow-format/version.rb
!ruby/red-arrow-format/red-arrow-format.gemspec
!ruby/red-gandiva/Gemfile
!ruby/red-gandiva/lib/gandiva/version.rb
!ruby/red-gandiva/red-gandiva.gemspec
!ruby/red-parquet/Gemfile
!ruby/red-parquet/lib/parquet/version.rb
!ruby/red-parquet/red-parquet.gemspec
83 changes: 83 additions & 0 deletions .editorconfig
Original file line number Diff line number Diff line change
@@ -0,0 +1,83 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

# This is an EditorConfig file: https://editorconfig.org/

# This is the top-most config for this project
root = true

# General settings

[*]
insert_final_newline = true
trim_trailing_whitespace = true
charset = utf-8
spelling_language = en

# Language-specific settings, in approximate alphabetical order

[*.{c,cc,cpp,h,hh,hpp}]
indent_size = 2
indent_style = space

[*.cmake]
indent_size = 2
indent_style = space

[CMakeLists.txt]
indent_size = 2
indent_style = space

[{meson.build,meson_options.txt}]
indent_size = 4
indent_style = space

[*.cs]
indent_size = 4
indent_style = space

[*.{fbs,proto,thrift}]
indent_size = 2
indent_style = space

[*.{py,pyx,pxd,pxi}]
indent_size = 4
indent_style = space

[*.r]
indent_size = 2
indent_style = space

[*.rb]
indent_size = 2
indent_style = space

[*.rst]
indent_size = 3
indent_style = space

[*.sh]
indent_size = 2
indent_style = space

[*.vala]
indent_size = 4
indent_style = space

[*.{yml,yaml}]
indent_size = 2
indent_style = space
109 changes: 109 additions & 0 deletions .env
Original file line number Diff line number Diff line change
@@ -0,0 +1,109 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

# All of the following environment variables are required to set default values
# for the parameters in compose.yaml.

# empty prefix means that the docker compose configuration will use named
# volumes which potentially improves the performance on docker for macos and
# docker for windows, it also prevents the contamination of the source
# directory
# a non-empty prefix means that directories from the host are bind-mounted
# into the container, it should be set to ".docker/" on github actions to keep
# the cache plugin functional
DOCKER_VOLUME_PREFIX=

# turn on inline build cache, this is a docker buildx feature documented
# at https://github.com/docker/buildx#--cache-tonametypetypekeyvalue
BUILDKIT_INLINE_CACHE=1
COMPOSE_DOCKER_CLI_BUILD=1
DOCKER_BUILDKIT=1

# different architecture notations
ARCH=amd64
ARCH_ALIAS=x86_64
ARCH_SHORT=amd64
# For aarch64
# ARCH=arm64v8
# ARCH_ALIAS=aarch64
# ARCH_SHORT=arm64

# Default repository to pull and push images from
REPO=apache/arrow-dev

# The setup attempts to generate coredumps by default, in order to disable the
# coredump generation set it to 0
ULIMIT_CORE=-1

# Default versions for platforms
ALMALINUX=8
ALPINE_LINUX=3.22
DEBIAN=13
FEDORA=42
UBUNTU=22.04

# Default versions for various dependencies
CLANG_TOOLS=18
CMAKE=3.26.0
CUDA=11.7.1
DASK=latest
GCC=
HDFS=3.2.1
JDK=11
# LLVM 12 and GCC 11 reports -Wmismatched-new-delete.
LLVM=18
MAVEN=3.8.7
NODE=20
NUMBA=latest
NUMBA_CUDA=latest
NUMPY=latest
PANDAS=latest
PYTHON=3.10
PYTHON_IMAGE_TAG=3.10
PYTHON_ABI_TAG=cp310
R=4.5
SPARK=master

# These correspond to images on Docker Hub that contain R, e.g. rhub/ubuntu-release:latest
R_IMAGE=ubuntu-release
R_ORG=rhub
R_TAG=latest

# Env vars for R builds
R_UPDATE_CLANG=false
R_CUSTOM_CCACHE=false
ARROW_R_DEV=TRUE
R_PRUNE_DEPS=FALSE
TZ=UTC

# Used through compose.yaml and serves as the default version for the
# ci/scripts/install_vcpkg.sh script. Prefer to use short SHAs to keep the
# docker tags more readable.
VCPKG="66c0373dc7fca549e5803087b9487edfe3aca0a1" # 2026.01.16 Release

# This must be updated when we update
# ci/docker/python-*-windows-*.dockerfile or the vcpkg config.
# This is a workaround for our CI problem that "archery docker build" doesn't
# use pulled built images in dev/tasks/python-wheels/github.windows.yml.
PYTHON_WHEEL_WINDOWS_IMAGE_REVISION=2026-02-25
PYTHON_WHEEL_WINDOWS_TEST_IMAGE_REVISION=2026-02-25

# Use conanio/${CONAN_BASE}:{CONAN_VERSION} for "docker compose run --rm conan".
# See https://github.com/conan-io/conan-docker-tools#readme and
# https://hub.docker.com/u/conanio for available images.
CONAN_BASE=gcc11-ubuntu16.04
CONAN_VERSION=2.12.1
15 changes: 15 additions & 0 deletions .gitattributes
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
cpp/src/arrow/util/bpacking_*_generated_internal.h linguist-generated=true
cpp/src/parquet/chunker_*_generated.h linguist-generated=true
cpp/src/generated/*.cpp linguist-generated=true
cpp/src/generated/*.h linguist-generated=true
go/**/*.s linguist-generated=true
go/arrow/unionmode_string.go linguist-generated=true
go/arrow/internal/flatbuf/*.go linguist-generated=true
go/**/*.pb.go linguist-generated=true
go/parquet/internal/gen-go/parquet/*.go linguist-generated=true
r/R/RcppExports.R linguist-generated=true
r/R/arrowExports.R linguist-generated=true
r/src/RcppExports.cpp linguist-generated=true
r/src/arrowExports.cpp linguist-generated=true
r/man/*.Rd linguist-generated=true
r/NEWS.md merge=union
Loading