pub struct JsonArrayToNdjsonReader<R: Read> { /* private fields */ }Expand description
A streaming reader that converts JSON array format to NDJSON format.
This reader wraps an underlying reader containing JSON array data
[{...}, {...}, ...] and transforms it on-the-fly to newline-delimited
JSON format that Arrow’s JSON reader can process.
Implements both Read and BufRead traits for compatibility with Arrow’s
ReaderBuilder::build() which requires BufRead.
§Transformation Rules
- Skip leading
[and whitespace before it - Convert top-level
,(between objects) to\n - Skip whitespace at top level (between objects)
- Stop at trailing
] - Preserve everything inside objects (including nested
[,],,) - Properly handle strings (ignore special chars inside quotes)
§Example
Input: [{"a":1}, {"b":[1,2]}, {"c":"x,y"}]
Output: {"a":1}
{"b":[1,2]}
{"c":"x,y"}Implementations§
Source§impl<R: Read> JsonArrayToNdjsonReader<R>
impl<R: Read> JsonArrayToNdjsonReader<R>
Sourcepub fn new(reader: R) -> Self
pub fn new(reader: R) -> Self
Create a new streaming reader that converts JSON array to NDJSON.
Sourcepub fn with_capacity(reader: R, capacity: usize) -> Self
pub fn with_capacity(reader: R, capacity: usize) -> Self
Create a new streaming reader with custom buffer size.
Larger buffers improve throughput but use more memory. Total memory usage is approximately 2 * capacity (input + output buffers).
Sourcepub fn validate_complete(&self) -> Result<()>
pub fn validate_complete(&self) -> Result<()>
Check if the JSON array was properly terminated.
This should be called after all data has been read.
Returns an error if:
- Unbalanced braces/brackets (depth != 0)
- Unterminated string
- Missing closing
] - Unexpected trailing content after
]
Trait Implementations§
Source§impl<R: Read> BufRead for JsonArrayToNdjsonReader<R>
impl<R: Read> BufRead for JsonArrayToNdjsonReader<R>
Source§fn fill_buf(&mut self) -> Result<&[u8]>
fn fill_buf(&mut self) -> Result<&[u8]>
Returns the contents of the internal buffer, filling it with more data, via
Read methods, if empty. Read moreSource§fn consume(&mut self, amt: usize)
fn consume(&mut self, amt: usize)
Marks the given
amount of additional bytes from the internal buffer as having been read.
Subsequent calls to read only return bytes that have not been marked as read. Read moreSource§fn has_data_left(&mut self) -> Result<bool, Error>
fn has_data_left(&mut self) -> Result<bool, Error>
🔬This is a nightly-only experimental API. (
buf_read_has_data_left)Checks if there is any data left to be
read. Read more1.83.0 · Source§fn skip_until(&mut self, byte: u8) -> Result<usize, Error>
fn skip_until(&mut self, byte: u8) -> Result<usize, Error>
Skips all bytes until the delimiter
byte or EOF is reached. Read more1.0.0 · Source§fn read_line(&mut self, buf: &mut String) -> Result<usize, Error>
fn read_line(&mut self, buf: &mut String) -> Result<usize, Error>
Reads all bytes until a newline (the
0xA byte) is reached, and append
them to the provided String buffer. Read moreSource§impl<R: Read> Read for JsonArrayToNdjsonReader<R>
impl<R: Read> Read for JsonArrayToNdjsonReader<R>
Source§fn read(&mut self, buf: &mut [u8]) -> Result<usize>
fn read(&mut self, buf: &mut [u8]) -> Result<usize>
Pull some bytes from this source into the specified buffer, returning
how many bytes were read. Read more
1.36.0 · Source§fn read_vectored(&mut self, bufs: &mut [IoSliceMut<'_>]) -> Result<usize, Error>
fn read_vectored(&mut self, bufs: &mut [IoSliceMut<'_>]) -> Result<usize, Error>
Like
read, except that it reads into a slice of buffers. Read moreSource§fn is_read_vectored(&self) -> bool
fn is_read_vectored(&self) -> bool
🔬This is a nightly-only experimental API. (
can_vector)1.0.0 · Source§fn read_to_end(&mut self, buf: &mut Vec<u8>) -> Result<usize, Error>
fn read_to_end(&mut self, buf: &mut Vec<u8>) -> Result<usize, Error>
Reads all bytes until EOF in this source, placing them into
buf. Read more1.0.0 · Source§fn read_to_string(&mut self, buf: &mut String) -> Result<usize, Error>
fn read_to_string(&mut self, buf: &mut String) -> Result<usize, Error>
Reads all bytes until EOF in this source, appending them to
buf. Read more1.6.0 · Source§fn read_exact(&mut self, buf: &mut [u8]) -> Result<(), Error>
fn read_exact(&mut self, buf: &mut [u8]) -> Result<(), Error>
Reads the exact number of bytes required to fill
buf. Read moreSource§fn read_buf(&mut self, buf: BorrowedCursor<'_>) -> Result<(), Error>
fn read_buf(&mut self, buf: BorrowedCursor<'_>) -> Result<(), Error>
🔬This is a nightly-only experimental API. (
read_buf)Pull some bytes from this source into the specified buffer. Read more
Source§fn read_buf_exact(&mut self, cursor: BorrowedCursor<'_>) -> Result<(), Error>
fn read_buf_exact(&mut self, cursor: BorrowedCursor<'_>) -> Result<(), Error>
🔬This is a nightly-only experimental API. (
read_buf)Reads the exact number of bytes required to fill
cursor. Read more1.0.0 · Source§fn by_ref(&mut self) -> &mut Selfwhere
Self: Sized,
fn by_ref(&mut self) -> &mut Selfwhere
Self: Sized,
Creates a “by reference” adapter for this instance of
Read. Read more1.0.0 · Source§fn chain<R>(self, next: R) -> Chain<Self, R>
fn chain<R>(self, next: R) -> Chain<Self, R>
Creates an adapter which will chain this stream with another. Read more
Auto Trait Implementations§
impl<R> Freeze for JsonArrayToNdjsonReader<R>where
R: Freeze,
impl<R> RefUnwindSafe for JsonArrayToNdjsonReader<R>where
R: RefUnwindSafe,
impl<R> Send for JsonArrayToNdjsonReader<R>where
R: Send,
impl<R> Sync for JsonArrayToNdjsonReader<R>where
R: Sync,
impl<R> Unpin for JsonArrayToNdjsonReader<R>where
R: Unpin,
impl<R> UnsafeUnpin for JsonArrayToNdjsonReader<R>where
R: UnsafeUnpin,
impl<R> UnwindSafe for JsonArrayToNdjsonReader<R>where
R: UnwindSafe,
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more