| [R] read_ipc_stream fails to unify nested uint64 fields inside a Struct array across record batches |
Issue |
R |
2026-07-02 |
2 |
[R] col_types ignored when convert_options specified |
Issue |
R |
2025-11-16 |
230 |
| [R] Mixed-type list columns fail with unintuitive error message |
Issue |
R |
2025-08-27 |
311 |
| [Python] read_csv converts strings with leading zeros to integers |
Issue |
Python, Documentation |
2025-06-18 |
381 |
| [Python][C++] Update type definition of npy_traits for Float16 to arrow::util::Float16 instead of uint16_t |
Issue |
C++, Python |
2025-06-04 |
395 |
[CI][Crossbow][Dev] Continue generating nightlies dashboard for CI: Extra scheduled runs |
Issue |
Continuous Integration, Developer Tools |
2025-06-02 |
397 |
| [C++][Parquet] Make error reporting more detailed |
Issue |
Parquet, C++ |
2025-05-27 |
403 |
Consider adding CITATION.cff for citation details |
Issue |
Other |
2025-05-16 |
414 |
[C++] Review arrow/json headers for internal APIs |
Issue |
C++ |
2025-05-15 |
415 |
[C++] Review arrow/csv headers for internal APIs |
Issue |
C++ |
2025-05-15 |
415 |
| [C++][Parquet] Review headers for internal APIs |
Issue |
Parquet, C++ |
2025-05-15 |
415 |
| [FlightSQL][Docs] Document current state of FlightSQL support |
Issue |
FlightRPC, Documentation |
2025-05-07 |
423 |
| [Python] Support pyarrow.Table.cast with CastOptions |
Issue |
Python |
2025-04-14 |
446 |
| [Python] Expose testing data generation utility |
Issue |
Python |
2025-04-07 |
453 |
| [C++][Docs] C++ documentation for constructors is missing |
Issue |
Documentation |
2025-03-19 |
472 |
| [C++] Statistics Schema Implementation |
Issue |
C++, Python, Documentation |
2025-03-15 |
476 |
| [C++][Compute] Add FunctionOptions::Validate |
Issue |
C++ |
2025-03-13 |
478 |
[R] Don’t construct arrow_binary class vector in favor of blob::blob |
Issue |
R |
2025-03-08 |
483 |
| [R] creating arrow supported expressions |
Issue |
R |
2025-02-05 |
514 |
| [Ruby] Improve JRuby Support |
Issue |
Ruby |
2025-01-21 |
529 |
| [R] arrow R package: multiple replacement disclaimers for str_replace_all |
Issue |
R |
2025-01-20 |
530 |
| [Parquet][C++] PageIndex is useless with current API |
Issue |
Parquet, C++ |
2025-01-16 |
534 |
| [C++][Python] Implement pc.equal for List arguments |
Issue |
C++, Python |
2025-01-04 |
546 |
| [Docs] Clean up LICENSE and NOTICE files |
Issue |
Documentation |
2024-11-28 |
583 |
| [R] Provide helpful hints for NotImplemented kernel errors |
Issue |
R |
2024-11-15 |
596 |
| [C++][Python] Potential improvements around supply chain security |
Issue |
C++, Python |
2024-11-09 |
602 |
| [Python] Efficient way to iterate over groups |
Issue |
Python |
2024-11-07 |
604 |
| [R] Add Cumsum and duplicated bindings to datasets in R |
Issue |
R |
2024-11-07 |
604 |
| [CI][Packaging][Python] Enable BuildKit for building wheel on Windows |
Issue |
Python, Continuous Integration, Packaging |
2024-11-07 |
604 |
[Dev][Archery] Use --arrow-ref instead of --arrow-sha in archery crossbow submit |
Issue |
Archery, Developer Tools |
2024-10-31 |
611 |
| [Format][Docs] Describe C device interface on C data interface and C stream interface docs pages |
Issue |
Documentation, Format |
2024-10-26 |
616 |
| Misleading error message when casting |
Issue |
Python |
2024-10-25 |
617 |
| [CI][C++] Use a separated Docker image for Emscripten |
Issue |
C++, Continuous Integration |
2024-10-18 |
624 |
| [CI][C++] Add clang-cl job |
Issue |
C++ |
2024-10-11 |
631 |
[GLib] Add a sub Buffer class for GBytes based buffer |
Issue |
GLib |
2024-10-09 |
633 |
| [C++] Binary View Compute Kernels |
Issue |
C++ |
2024-10-08 |
634 |
| [R] please write unregister_scalar_function and/or make registration local/temporary |
Issue |
R |
2024-10-04 |
638 |
[Python] Allow PyCapsule Interface in pyarrow.scalar constructor? |
Issue |
Python |
2024-09-25 |
647 |
| [R] Support integer date and time classes from data.table |
Issue |
R |
2024-09-19 |
653 |
| [C++][Parquet] Add support for arrow::ArrayStatistics |
Issue |
Parquet, C++ |
2024-08-04 |
699 |
| [R] Subtracting X days from a given date in ymd format |
Issue |
R |
2024-08-01 |
702 |
| [Python] Move tests that are explicitly about conversion to/from numpy on test_array.py to a separate file |
Issue |
Python |
2024-08-01 |
702 |
| [R] Implement anonymous functions in calls to dplyr::across |
Issue |
R |
2024-07-10 |
724 |
| [CI][Dev] Add shell script formatter |
Issue |
Continuous Integration, Developer Tools |
2024-06-28 |
736 |
| [C++] Add support for system mimalloc |
Issue |
C++ |
2024-06-19 |
745 |
| [Python] Get size of IPC File ahead of time |
Issue |
Python |
2024-06-07 |
757 |
| [Python] Conversion to/from numpy 2.0+ new StringDType |
Issue |
Python |
2024-06-06 |
758 |
| [Docs][Format] Move IPC format spec back into a separate page |
Issue |
Documentation, Format |
2024-05-15 |
780 |
| [C++][Python] Update DLPack version |
Issue |
C++, Python |
2024-05-15 |
780 |
| [C++][Parquet] Predicate pushdown through arrow::dataset::ScanBuilder::Filter() not available on list fields |
Issue |
Parquet, C++ |
2024-05-14 |
781 |
| [R] Unable to disable url-encoding |
Issue |
R |
2024-05-10 |
785 |
| [Python] Pyarrow fs incorrectly resolves S3 URIs with white space as a local path |
Issue |
Python |
2024-04-24 |
801 |
| [CI][Archery] Archery linking should also check for undefined symbols Windows |
Issue |
Archery |
2024-04-03 |
822 |
| [CI][Archery] Archery linking should also check for undefined symbols macOS |
Issue |
Archery |
2024-04-03 |
822 |
| [CI][Archery] Archery linking should also check for undefined symbols Linux |
Issue |
Archery |
2024-04-03 |
822 |
| [Ruby] Improve Ruby’s GC integration |
Issue |
Ruby |
2024-03-29 |
827 |
| [Python] Allow pyarrow import to fail without triggering Py_FatalError |
Issue |
Python |
2024-03-27 |
829 |
| [Parquet] Make default fallback encoding choice smarter |
Issue |
Parquet, C++ |
2024-03-18 |
838 |
| [Python][Docs] Max batch size for Dataset |
Issue |
Python, Documentation |
2024-03-15 |
841 |
| [C++] Update vendored FlatBuffers to 24 |
Issue |
C++ |
2024-03-14 |
842 |
| [C++] String manipulation on a dictionary column |
Issue |
C++ |
2024-03-08 |
848 |
| [Python] Consider splitting _lib module into several parts |
Issue |
Python |
2024-02-20 |
865 |
| [Docs] Add a doc section for tensor arrays |
Issue |
Documentation |
2024-02-08 |
877 |
| [R] Export functions for low-level pointer operations |
Issue |
R |
2024-01-25 |
891 |
| [R] Update the docs to show how to avoid situations like data loss with leading zero in partition column |
Issue |
R |
2024-01-17 |
899 |
| [C++][Python] Floordiv compute kernel |
Issue |
C++, Python |
2023-12-29 |
918 |
| [Python] Add timezone information when printing TimestampArray |
Issue |
R, Python |
2023-12-20 |
927 |
| [Python] Clean up ExtensionType.__reduce__ |
Issue |
Python |
2023-12-06 |
941 |
| [C++][Parquet] Parquet: support exact in Page/Row-Group level Statistics |
Issue |
Parquet, C++ |
2023-11-23 |
954 |
[R] Write metadata to parquet file as argument to write_parquet() |
Issue |
R |
2023-11-19 |
958 |
| [R][Documentation] Document add_filename on open_dataset help page |
Issue |
R, Documentation |
2023-11-18 |
959 |
| [R] preserve hive partitions when opening along a path / path vector |
Issue |
R, C++, Python |
2023-11-15 |
962 |
| [Python] Support serialization of Arrow files on disk without the identifier “Feather” |
Issue |
Python |
2023-10-30 |
978 |
| [R] Inconsistent naming |
Issue |
R |
2023-10-25 |
983 |
| [Integration] Test non-zero offsets in C Data Interface |
Issue |
Integration |
2023-10-19 |
989 |
| [R][Docs] Add section on debugging S3 in the R developer docs |
Issue |
R, Documentation |
2023-10-14 |
994 |
| [R] open_dataset - format is unclear |
Issue |
R |
2023-10-10 |
998 |
[R] expose decimal_point argument in CSVConvertOptions |
Issue |
R |
2023-10-03 |
1005 |
| [R] open_dataset() behavior with incorrectly quoted input data |
Issue |
R |
2023-09-27 |
1011 |
| [C++] Implement REE support in ArrayFromJSONString |
Issue |
C++ |
2023-08-23 |
1046 |
| [R] Error passing data to/from DuckDB - “NotImplemented: Call to R (SafeRecordBatchReader::ReadNext()) from a non-R thread from an unsupported context” |
Issue |
R |
2023-08-22 |
1047 |
| GH-36831: [C++] DictionaryArray support for MinMax Function |
PR |
C++ |
2023-08-10 |
1059 |
| Missing kernels for ordering with struct types |
Issue |
Python |
2023-08-09 |
1060 |
| [C++][Parquet] Process parquet rowgroups without Arrow conversion |
Issue |
Parquet, C++, Python |
2023-05-17 |
1144 |
| [C++] Why is arrow mmap marked MAP_PRIVATE (during read)? |
Issue |
C++ |
2023-04-25 |
1166 |
| [Python] Bindings for FixedShapeTensorType.FromTensor/ToTensor and FixedShapeTensorArray.strides |
Issue |
Python |
2023-04-12 |
1179 |
| [Python][Docs] Update/rearrange Data Types section and add FixedShapeTensorType |
Issue |
Python, Documentation |
2023-04-12 |
1179 |
[R] Add an argument to open_csv_dataset() to repair duplicated column names or ignore them? |
Issue |
R |
2023-04-07 |
1184 |
| [Python] unexpected URL encoded path (white spaces) when uploading to S3 |
Issue |
Python |
2023-04-05 |
1186 |
| [R][Python] Expand coverage of and align R/Python to C++ CSV WriteOptions |
Issue |
R, Python |
2023-03-15 |
1207 |
| [Format][FlightRPC] Transfer FlightData in pieces |
Issue |
FlightRPC, Format |
2023-03-07 |
1215 |
| [R] Named lists cannot be serialized to a map column |
Issue |
R |
2023-03-02 |
1220 |
| [C++] Create the first binary aggregate function kernel to serve as an example for other implementations |
Issue |
C++ |
2023-01-30 |
1251 |
| [R] writing/reading a data.frame with column class ‘list’ changes column class |
Issue |
R |
2023-01-19 |
1262 |
[R] read_csv_arrow()’s timestamp_parsers parameter is a bit light on documentation and doesn’t appear to do anything |
Issue |
R |
2023-01-16 |
1265 |
| [C++][Python] Support parsing a StringArray full of JSON to a Table |
Issue |
C++, Python |
2023-01-13 |
1268 |
| [R] feather round-trip support for named vectors in list columns |
Issue |
R |
2022-12-19 |
1293 |
| [Python] test_get_include failing in conda builds on unix |
Issue |
Python, Continuous Integration |
2022-12-18 |
1294 |
| [Release] Changelog.md on master branch has not been updated since 6.0.1 |
Issue |
Developer Tools, Release |
2022-12-16 |
1296 |
| [Dev] Comment bot embeds closes comment on code block |
Issue |
Developer Tools |
2022-12-15 |
1297 |
| [R] Filter operations not shown when called before summarise |
Issue |
R |
2022-11-25 |
1317 |
| [C++] Decide on duplicate column handling in scanner, add more tests |
Issue |
C++ |
2022-11-22 |
1320 |
| [C++] Add a “list_contains” kernel |
Issue |
C++ |
2022-10-19 |
1354 |
| [C++][Python] Allow an ExtensionType to register or implement custom casts |
Issue |
C++, Python |
2022-09-29 |
1374 |
[R] Update make_date, make_datetime, ISOdate and ISOdatetime to use tz |
Issue |
R |
2022-07-12 |
1453 |
| “Edit this page” on docstring generated docs gives 404 |
Issue |
Documentation |
2022-06-10 |
1485 |
| [R] printing data in Table/RecordBatch print method |
Issue |
R, Python |
2022-06-07 |
1488 |
| [R] Integer overflow causes error - (in dplyr we get an NA with a warning) |
Issue |
R |
2022-05-03 |
1523 |
| [Archery] Add documentation for local development in archery/crossbow |
Issue |
Developer Tools |
2022-04-12 |
1544 |
| [Python] Version=7.0.0 introduces bug when filtering by empty set during load |
Issue |
Python |
2022-03-28 |
1559 |
| [R] Arrow/Parquet can’t open encrypted parquet files |
Issue |
R |
2022-01-26 |
1620 |
| [C++][Python] Slicing a table with no columns returns a table with incorrect length. |
Issue |
C++, Python |
2022-01-22 |
1624 |
| [Python][Docs] Opening a partitioned dataset with schema and filter |
Issue |
Python, Documentation |
2022-01-12 |
1634 |
| [C++] Name the threads in thread pools |
Issue |
C++ |
2022-01-07 |
1639 |
| [R] Implement bindings for stringr’s combining strings functions |
Issue |
R |
2021-11-24 |
1683 |
| [C++][R]Opening a multi-file dataset and writing a re-partitioned version of it fails |
Issue |
R, C++ |
2021-11-17 |
1690 |
| [C++][Dataset] Change scanner readahead limits to be based on bytes instead of number of batches |
Issue |
C++ |
2021-11-09 |
1698 |
| [C++][Dataset] Devise a mechanism to limit the total “system ram” (process + cache) used by dataset writes |
Issue |
C++ |
2021-11-08 |
1699 |
| [Docs] [Benchmarking] Add conbench to the benchmarking docs |
Issue |
Documentation, Benchmarking |
2021-11-03 |
1704 |
| [C++][R] Inconsistent application of type in Datasets via the schema |
Issue |
R, C++ |
2021-10-14 |
1724 |
| [R] Selecting colums while reading Parquet file with nested types can give wrong column |
Issue |
R |
2021-10-11 |
1727 |
| [R] Support inequality joins |
Issue |
R |
2021-10-08 |
1730 |
| [Python] Support other interval types |
Issue |
Python |
2021-10-07 |
1731 |
| [R] Empty character attributes not stored |
Issue |
R |
2021-08-09 |
1790 |
| [R] Throw helpful errors on bad object types in dplyr expressions |
Issue |
R |
2021-03-25 |
1927 |
| [Python][Dataset] The first table schema becomes a common schema for the full Dataset |
Issue |
Python, Documentation |
2021-03-24 |
1928 |
| [R] Support for Tensor class |
Issue |
R |
2021-02-15 |
1965 |
| [C++][Dataset] Provide more robust handling of comparison guarantees in the presence of implicit casts |
Issue |
C++ |
2021-02-08 |
1972 |
| [R] ChunkedArray$create assumes all chunks are the same type |
Issue |
R |
2021-01-11 |
2000 |
| [Ruby] Table#initialize examples are out of date |
Issue |
Ruby, Documentation |
2020-11-14 |
2058 |
| [C++][Python] .take silently overflow on list array (when casting to large_list is needed) |
Issue |
C++, Python |
2020-11-04 |
2068 |
| [Python] Schema Evolution - Add new Field |
Issue |
Python |
2020-09-08 |
2125 |
| [C++][Python] pa.array raises for mixed scalar types (float16 + int) |
Issue |
Python |
2020-08-21 |
2143 |
| [Python] Array.__str__ shows misleading output for timestamp types with time zone set |
Issue |
C++, Python |
2020-07-19 |
2176 |
| [C++/Python] Kernel for SetItem(IntegerArray, values) (“replace_with_indices”) |
Issue |
C++, Python |
2020-07-13 |
2182 |
| [Python][Dataset] Detect and use _metadata file in a list of file paths |
Issue |
Python |
2020-04-14 |
2272 |
| [C++][Dataset] Handling of duplicate columns in Dataset factory and scanning |
Issue |
C++ |
2020-03-25 |
2292 |
| [Crossbow] Eliminate libgit2 dependency |
Issue |
Developer Tools |
2020-03-12 |
2305 |
| [C++][Python] ArrowIOError: Invalid Parquet file size is 0 bytes on reading from S3 |
Issue |
C++, Python |
2020-02-16 |
2330 |
| [Python] consistently handle conversion of all-NaN arrays across types |
Issue |
Python |
2019-09-12 |
2487 |
| [Python] Array equals returns incorrectly if NaNs are in arrays |
Issue |
Python |
2019-07-25 |
2536 |
| [C++][Python] Support ExtensionType arrays in more kernels |
Issue |
C++ |
2019-07-09 |
2552 |
| [Doc] Better document the Tensor classes in the prose documentation |
Issue |
C++, Python, Documentation |
2019-07-04 |
2557 |
| [GLib] Add support for arrow::DictionaryBuilder |
Issue |
GLib |
2019-03-16 |
2667 |
| [Python] Add documentation section for integrations with PyTorch, TensorFlow |
Issue |
Python, Documentation |
2018-02-01 |
3075 |