Class LookaheadCharInputReader
java.lang.Object
com.univocity.parsers.common.input.LookaheadCharInputReader
- All Implemented Interfaces:
CharInput,CharInputReader
A special implementation of
CharInputReader that wraps another CharInputReader and
collects a sequence of characters from the wrapped input, in order to analyze what the buffer contains
ahead of the current position.-
Constructor Summary
ConstructorsConstructorDescriptionLookaheadCharInputReader(CharInputReader reader, char newLine, int whitespaceRangeStart) Creates a lookahead input reader by wrapping a givenCharInputReaderimplementation -
Method Summary
Modifier and TypeMethodDescriptionlongReturns the number of characters returned byCharInputReader.nextChar()at any given time.Returns a String with the input character sequence parsed to produce the current record.intReturns the length of the character sequence parsed to produce the current record.voidenableNormalizeLineEndings(boolean escaping) Indicates to the input reader that the parser is running in "escape" mode and new lines should be returned as-is to prevent modifying the content of the parsed value.final chargetChar()Returns the last character returned by theCharInputReader.nextChar()method.char[]Returns the line separator by this character input reader.Returns the current lookahead value.getLookahead(char current) Returns the lookahead value prepended with the current charactergetQuotedString(char quote, char escape, char escapeEscape, int maxLength, char stop1, char stop2, boolean keepQuotes, boolean keepEscape, boolean trimLeading, boolean trimTrailing) Attempts to collect a quotedStringfrom the current position until a closing quote or stop character is found on the input, or a line ending is reached.Attempts to collect aStringfrom the current position until a stop character is found on the input, or a line ending is reached.intlastIndexOf(char ch) Returns the last index of a given character in the current parsed contentlongReturns the number of newlines read so far.voidlookahead(int numberOfCharacters) Fills the lookahead buffer with a given number of characters that will be extracted from the wrappedCharInputReadervoidMarks the start of a new record in the input, used internally to calculate the result ofCharInputReader.currentParsedContent()booleanmatches(char[] sequence, char wildcard) Matches a sequence of characters against the current lookahead buffer.booleanmatches(char current, char[] sequence, char wildcard) Matches a sequence of characters against the current lookahead buffer.charnextChar()Returns the next character in the input provided by the activeReader.Collects the comment line found on the input.voidvoidskipLines(long lineCount) Skips characters in the input until the given number of lines is discarded.booleanskipQuotedString(char quote, char escape, char stop1, char stop2) Attempts to skip a quotedStringfrom the current position until a stop character is found on the input, or a line ending is reached.booleanskipString(char ch, char stop) Attempts to skip aStringfrom the current position until a stop character is found on the input, or a line ending is reached.charskipWhitespace(char ch, char stopChar1, char stopChar2) Skips characters from the current input position, until a non-whitespace character, or a stop character is foundvoidInitializes the CharInputReader implementation with aReaderwhich provides access to the input.voidstop()Stops the CharInputReader from reading characters from theReaderprovided inCharInputReader.start(Reader)and closes it.
-
Constructor Details
-
LookaheadCharInputReader
Creates a lookahead input reader by wrapping a givenCharInputReaderimplementation- Parameters:
reader- the input reader whose characters will read and stored in a limited internal buffer, in order to allow a parser to query what the characters are available ahead of the current input position.newLine- the normalized character that represents a line ending. Used internally as a stop character.whitespaceRangeStart- starting range of characters considered to be whitespace.
-
-
Method Details
-
matches
public boolean matches(char current, char[] sequence, char wildcard) Matches a sequence of characters against the current lookahead buffer.- Parameters:
current- the last character used by the parser, which should match the first character in the lookahead buffersequence- the expected sequence of characters after the current character, that are expected appear in the current lookahead bufferwildcard- character used in the sequence as a wildcard (e.g. * or ?), meaning any character is acceptable in its place.- Returns:
trueif the current character and the sequence characters that follows are present in the lookahead, otherwisefalse
-
matches
public boolean matches(char[] sequence, char wildcard) Matches a sequence of characters against the current lookahead buffer.- Parameters:
sequence- the expected sequence of characters that are expected appear in the current lookahead bufferwildcard- character used in the sequence as a wildcard (e.g. * or ?), meaning any character is acceptable in its place.- Returns:
trueif the given sequence of characters is present in the lookahead, otherwisefalse
-
getLookahead
Returns the current lookahead value.- Returns:
- the current lookahead value, or an empty
Stringif the lookahead buffer is empty.
-
getLookahead
Returns the lookahead value prepended with the current character- Parameters:
current- the current character obtained by the parser- Returns:
- a
Stringformed by the given character followed by the lookahead value (if any).
-
lookahead
public void lookahead(int numberOfCharacters) Fills the lookahead buffer with a given number of characters that will be extracted from the wrappedCharInputReader- Parameters:
numberOfCharacters- the number of characters to read from the wrappedCharInputReader, given in the constructor of this class.
-
start
Description copied from interface:CharInputReaderInitializes the CharInputReader implementation with aReaderwhich provides access to the input.- Specified by:
startin interfaceCharInputReader- Parameters:
reader- AReaderthat provides access to the input.
-
stop
public void stop()Description copied from interface:CharInputReaderStops the CharInputReader from reading characters from theReaderprovided inCharInputReader.start(Reader)and closes it.- Specified by:
stopin interfaceCharInputReader
-
nextChar
public char nextChar()Description copied from interface:CharInputReaderReturns the next character in the input provided by the activeReader.If the input contains a sequence of newline characters (defined by
Format.getLineSeparator()), this method will automatically converted them to the newline character specified inFormat.getNormalizedNewline().A subsequent call to this method will return the character after the newline sequence.
- Specified by:
nextCharin interfaceCharInput- Specified by:
nextCharin interfaceCharInputReader- Returns:
- the next character in the input. '\0' if there are no more characters in the input or if the CharInputReader was stopped.
-
charCount
public long charCount()Description copied from interface:CharInputReaderReturns the number of characters returned byCharInputReader.nextChar()at any given time.- Specified by:
charCountin interfaceCharInputReader- Returns:
- the number of characters returned by
CharInputReader.nextChar()
-
lineCount
public long lineCount()Description copied from interface:CharInputReaderReturns the number of newlines read so far.- Specified by:
lineCountin interfaceCharInputReader- Returns:
- the number of newlines read so far.
-
skipLines
public void skipLines(long lineCount) Description copied from interface:CharInputReaderSkips characters in the input until the given number of lines is discarded.- Specified by:
skipLinesin interfaceCharInputReader- Parameters:
lineCount- the number of lines to skip from the current location in the input
-
enableNormalizeLineEndings
public void enableNormalizeLineEndings(boolean escaping) Description copied from interface:CharInputReaderIndicates to the input reader that the parser is running in "escape" mode and new lines should be returned as-is to prevent modifying the content of the parsed value.- Specified by:
enableNormalizeLineEndingsin interfaceCharInputReader- Parameters:
escaping- flag indicating that the parser is escaping values and line separators are to be returned as-is.
-
readComment
Description copied from interface:CharInputReaderCollects the comment line found on the input.- Specified by:
readCommentin interfaceCharInputReader- Returns:
- the text found in the comment from the current position.
-
getLineSeparator
public char[] getLineSeparator()Description copied from interface:CharInputReaderReturns the line separator by this character input reader. This could be the line separator defined in theFormat.getLineSeparator()configuration, or the line separator sequence identified automatically whenCommonParserSettings.isLineSeparatorDetectionEnabled()evaluates totrue.- Specified by:
getLineSeparatorin interfaceCharInputReader- Returns:
- the line separator in use.
-
getChar
public final char getChar()Description copied from interface:CharInputReaderReturns the last character returned by theCharInputReader.nextChar()method.- Specified by:
getCharin interfaceCharInput- Specified by:
getCharin interfaceCharInputReader- Returns:
- the last character returned by the
CharInputReader.nextChar()method.'\0' if there are no more characters in the input or if the CharInputReader was stopped.
-
skipWhitespace
public char skipWhitespace(char ch, char stopChar1, char stopChar2) Description copied from interface:CharInputReaderSkips characters from the current input position, until a non-whitespace character, or a stop character is found- Specified by:
skipWhitespacein interfaceCharInputReader- Parameters:
ch- the current character of the inputstopChar1- the first stop character (which can be a whitespace)stopChar2- the second character (which can be a whitespace)- Returns:
- the first non-whitespace character (or delimiter) found in the input.
-
currentParsedContent
Description copied from interface:CharInputReaderReturns a String with the input character sequence parsed to produce the current record.- Specified by:
currentParsedContentin interfaceCharInputReader- Returns:
- the text content parsed for the current input record.
-
markRecordStart
public void markRecordStart()Description copied from interface:CharInputReaderMarks the start of a new record in the input, used internally to calculate the result ofCharInputReader.currentParsedContent()- Specified by:
markRecordStartin interfaceCharInputReader
-
getString
Description copied from interface:CharInputReaderAttempts to collect aStringfrom the current position until a stop character is found on the input, or a line ending is reached. If theStringcan be obtained, the current position of the parser will be updated to the last consumed character. If the internal buffer needs to be reloaded, this method will returnnulland the current position of the buffer will remain unchanged.- Specified by:
getStringin interfaceCharInputReader- Parameters:
ch- the current character to be considered. If equal to the stop character thenullValuewill be returnedstop- the stop character that identifies the end of the content to be collectedtrim- flag indicating whether or not trailing whitespaces should be discardednullValue- value to return when the length of the content to be returned is0.maxLength- the maximum length of theStringto be returned. If the length exceeds this limit,nullwill be returned- Returns:
- the
Stringfound on the input, ornullif the buffer needs to reloaded or the maximum length has been exceeded.
-
getQuotedString
public String getQuotedString(char quote, char escape, char escapeEscape, int maxLength, char stop1, char stop2, boolean keepQuotes, boolean keepEscape, boolean trimLeading, boolean trimTrailing) Description copied from interface:CharInputReaderAttempts to collect a quotedStringfrom the current position until a closing quote or stop character is found on the input, or a line ending is reached. If theStringcan be obtained, the current position of the parser will be updated to the last consumed character. If the internal buffer needs to be reloaded, this method will returnnulland the current position of the buffer will remain unchanged.- Specified by:
getQuotedStringin interfaceCharInputReader- Parameters:
quote- the quote characterescape- the quote escape characterescapeEscape- the escape of the quote escape charactermaxLength- the maximum length of theStringto be returned. If the length exceeds this limit,nullwill be returnedstop1- the first stop character that identifies the end of the content to be collectedstop2- the second stop character that identifies the end of the content to be collectedkeepQuotes- flag to indicate the quotes that wrap the resultingStringshould be kept.keepEscape- flag to indicate that escape sequences should be kepttrimLeading- flag to indicate leading whitespaces should be trimmedtrimTrailing- flag to indicate that trailing whitespaces should be trimmed- Returns:
- the
Stringfound on the input, ornullif the buffer needs to reloaded or the maximum length has been exceeded.
-
currentParsedContentLength
public int currentParsedContentLength()Description copied from interface:CharInputReaderReturns the length of the character sequence parsed to produce the current record.- Specified by:
currentParsedContentLengthin interfaceCharInputReader- Returns:
- the length of the text content parsed for the current input record
-
skipString
public boolean skipString(char ch, char stop) Description copied from interface:CharInputReaderAttempts to skip aStringfrom the current position until a stop character is found on the input, or a line ending is reached. If theStringcan be skipped, the current position of the parser will be updated to the last consumed character. If the internal buffer needs to be reloaded, this method will returnfalseand the current position of the buffer will remain unchanged.- Specified by:
skipStringin interfaceCharInputReader- Parameters:
ch- the current character to be considered. If equal to the stop characterfalsewill be returnedstop- the stop character that identifies the end of the content to be collected- Returns:
trueif an entireStringvalue was found on the input and skipped, orfalseif the buffer needs to reloaded.
-
skipQuotedString
public boolean skipQuotedString(char quote, char escape, char stop1, char stop2) Description copied from interface:CharInputReaderAttempts to skip a quotedStringfrom the current position until a stop character is found on the input, or a line ending is reached. If theStringcan be skipped, the current position of the parser will be updated to the last consumed character. If the internal buffer needs to be reloaded, this method will returnfalseand the current position of the buffer will remain unchanged.- Specified by:
skipQuotedStringin interfaceCharInputReader- Parameters:
quote- the quote characterescape- the quote escape characterstop1- the first stop character that identifies the end of the content to be collectedstop2- the second stop character that identifies the end of the content to be collected- Returns:
trueif an entireStringvalue was found on the input and skipped, orfalseif the buffer needs to reloaded.
-
lastIndexOf
public int lastIndexOf(char ch) Description copied from interface:CharInputReaderReturns the last index of a given character in the current parsed content- Specified by:
lastIndexOfin interfaceCharInputReader- Parameters:
ch- the character to look for- Returns:
- the last position of the given character in the current parsed content, or
-1if not found.
-
reloadBuffer
public void reloadBuffer()
-