class Parser::Source::Buffer
A buffer with source code. {Buffer} contains the source code itself, associated location information (name and first line), and takes care of encoding.
A source buffer is immutable once populated.
@!attribute [r] name
Buffer name. If the buffer was created from a file, the name corresponds to relative path to the file. @return [String] buffer name
@!attribute [r] first_line
First line of the buffer, 1 by default. @return [Integer] first line
@api public
Constants
- ENCODING_RE
@api private
Attributes
Public Class Methods
# File lib/parser/source/buffer.rb, line 105 def initialize(name, first_line = 1, source: nil) @name = name.to_s @source = nil @first_line = first_line @lines = nil @line_begins = nil # UTF-32-reencoded source for O(1) slicing @slice_source = nil # Cache for fast lookup @line_index_for_position = {} self.source = source if source end
Try to recognize encoding of ‘string` as Ruby would, i.e. by looking for magic encoding comment or UTF-8 BOM. `string` can be in any encoding.
@param [String] string @return [String, nil] encoding name, if recognized
# File lib/parser/source/buffer.rb, line 51 def self.recognize_encoding(string) return if string.empty? # extract the first two lines in an efficient way string =~ /\A(.*)\n?(.*\n)?/ first_line, second_line = $1, $2 if first_line.start_with?("\xef\xbb\xbf".freeze) # BOM return Encoding::UTF_8 elsif first_line[0, 2] == '#!'.freeze encoding_line = second_line else encoding_line = first_line end return nil if encoding_line.nil? || encoding_line[0] != '#' if (result = ENCODING_RE.match(encoding_line)) Encoding.find(result[3] || result[4] || result[6]) else nil end end
Recognize encoding of ‘input` and process it so it could be lexed.
* If `input` does not contain BOM or magic encoding comment, it is kept in the original encoding. * If the detected encoding is binary, `input` is kept in binary. * Otherwise, `input` is re-encoded into UTF-8 and returned as a new string.
This method mutates the encoding of ‘input`, but not its content.
@param [String] input @return [String] @raise [EncodingError]
# File lib/parser/source/buffer.rb, line 90 def self.reencode_string(input) original_encoding = input.encoding detected_encoding = recognize_encoding(input.force_encoding(Encoding::BINARY)) if detected_encoding.nil? input.force_encoding(original_encoding) elsif detected_encoding == Encoding::BINARY input else input. force_encoding(detected_encoding). encode(Encoding::UTF_8) end end
Public Instance Methods
Convert a character index into the source to a column number.
@param [Integer] position @return [Integer] column @api private
# File lib/parser/source/buffer.rb, line 233 def column_for_position(position) line_index = line_index_for_position(position) position - line_begins[line_index] end
Convert a character index into the source to a ‘[line, column]` tuple.
@param [Integer] position @return [[Integer, Integer]] ‘[line, column]`
# File lib/parser/source/buffer.rb, line 208 def decompose_position(position) line_index = line_index_for_position(position) line_begin = line_begins[line_index] [ @first_line + line_index , position - line_begin ] end
Number of last line in the buffer
@return [Integer]
# File lib/parser/source/buffer.rb, line 298 def last_line line_begins.size + @first_line - 2 end
Convert a character index into the source to a line number.
@param [Integer] position @return [Integer] line @api private
# File lib/parser/source/buffer.rb, line 222 def line_for_position(position) line_index_for_position(position) + @first_line end
Extract line ‘lineno` as a new `Range`, taking `first_line` into account.
@param [Integer] lineno @return [Range] @raise [IndexError] if ‘lineno` is out of bounds
# File lib/parser/source/buffer.rb, line 275 def line_range(lineno) index = lineno - @first_line if index < 0 || index + 1 >= line_begins.size raise IndexError, 'Parser::Source::Buffer: range for line ' \ "#{lineno} requested, valid line numbers are #{@first_line}.." \ "#{@first_line + line_begins.size - 2}" else Range.new(self, line_begins[index], line_begins[index + 1] - 1) end end
Populate this buffer from a string without encoding autodetection.
@param [String] input @raise [ArgumentError] if already populated @return [String]
# File lib/parser/source/buffer.rb, line 180 def raw_source=(input) if @source raise ArgumentError, 'Source::Buffer is immutable' end @source = input.gsub("\r\n".freeze, "\n".freeze).freeze if !@source.ascii_only? && @source.encoding != Encoding::UTF_32LE && @source.encoding != Encoding::BINARY @slice_source = @source.encode(Encoding::UTF_32LE) end end
Populate this buffer from correspondingly named file.
@example
Parser::Source::Buffer.new('foo/bar.rb').read
@return [Buffer] self @raise [ArgumentError] if already populated
# File lib/parser/source/buffer.rb, line 131 def read File.open(@name, 'rb') do |io| self.source = io.read end self end
# File lib/parser/source/buffer.rb, line 194 def slice(range) if @slice_source.nil? @source[range] else @slice_source[range].encode(@source.encoding) end end
Source
code contained in this buffer.
@return [String] source code @raise [RuntimeError] if buffer is not populated yet
# File lib/parser/source/buffer.rb, line 145 def source if @source.nil? raise RuntimeError, 'Cannot extract source from uninitialized Source::Buffer' end @source end
Populate this buffer from a string with encoding autodetection. ‘input` is mutated if not frozen.
@param [String] input @raise [ArgumentError] if already populated @raise [EncodingError] if ‘input` includes invalid byte sequence for the encoding @return [String]
# File lib/parser/source/buffer.rb, line 162 def source=(input) input = input.dup if input.frozen? input = self.class.reencode_string(input) unless input.valid_encoding? raise EncodingError, "invalid byte sequence in #{input.encoding.name}" end self.raw_source = input end
Extract line ‘lineno` from source, taking `first_line` into account.
@param [Integer] lineno @return [String] @raise [IndexError] if ‘lineno` is out of bounds
# File lib/parser/source/buffer.rb, line 264 def source_line(lineno) source_lines.fetch(lineno - @first_line).dup end
Return an ‘Array` of source code lines.
@return [Array<String>]
# File lib/parser/source/buffer.rb, line 243 def source_lines @lines ||= begin lines = @source.lines.to_a lines << ''.dup if @source.end_with?("\n".freeze) lines.each do |line| line.chomp!("\n".freeze) line.freeze end lines.freeze end end
@return [Range] A range covering the whole source
# File lib/parser/source/buffer.rb, line 289 def source_range @source_range ||= Range.new(self, 0, source.size) end