Source position seems off #296

chriseidhof · 2019-04-24T21:56:55Z

Give the following input:

X
    **p**

The cmark library reports (through cmark_node_get_start_column) that the strong node starts at line 2, column 1, and ends at column 5. I would have expected it to start at column 5. Is this a bug, or is my interpretation of start column wrong?

The text was updated successfully, but these errors were encountered:

honghaoz · 2019-04-24T22:11:25Z

Looks like this is related to the soft breaks:

jgm · 2019-04-25T14:27:20Z

I'll refer this to @kivikakk who added inline source positions in #228.

chriseidhof · 2019-04-25T18:35:15Z

Thank you. I added a passing test case in #297, hopefully that makes it easier to get started (or if this is expected behavior, we can merge it as-is).

honghaoz · 2019-04-26T00:13:00Z

For more information, I found that for md like

A
  B

For the 1st pass when generating the blocks. The container block's string content already skips the spaces. So the container block node has text A\nB\n not A\n B\n, that's probably why the source position is off for the lazy continuation lines.

honghaoz · 2019-04-26T00:15:35Z

Related issue: #204

kivikakk · 2019-04-26T05:51:53Z

This issue becomes more pronouncedly odd when you try the inverse:

A
B

produces

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE document SYSTEM "CommonMark.dtd">
<document sourcepos="1:1-2:1" xmlns="http://commonmark.org/xml/1.0">
  <paragraph sourcepos="1:3-2:1">
    <text sourcepos="1:3-1:3" xml:space="preserve">A</text>
    <softbreak />
    <text sourcepos="2:3-2:3" xml:space="preserve">B</text>
  </paragraph>
</document>

The lazy continuation means we end up treating everything as though it started at column 3.

honghaoz · 2019-04-26T19:04:33Z

@kivikakk I think that's because the first line has two leading spaces. If you try to change it to 3 or 1, you will see the start column of following lines changed.

I briefly went through the parsing logic:
https://github.com/commonmark/cmark/blob/master/src/blocks.c#L706 this line advances the parser->offset and parser->column for the 2nd line.

https://github.com/commonmark/cmark/blob/master/src/blocks.c#L186-L187 this line only adds the skipped text to the container node.

For example:

  A
  B

The text of the paragraph node is changed from A\n B\n to A\nB\n

jgm · 2019-04-28T17:08:57Z

Seems to me that the inline parser should receive a list of starting columns for each input line, which it can use to adjust source positions. This would avoid the need to strip whitespace (as in #298) which imposes a performance penalty. This adds a bit of complexity, which is why I didn't implement inline source positions originally.

bibo38 · 2023-01-13T22:24:11Z

I would also like to see this feature, as it would allow me to easily extract sections from a bigger markdown file. E.g. API descriptions, where I wanna parse the general format of the documentation (API call definitions, parameter tables), but leave the description in markdown to be processed later on (if even necessary).

chriseidhof mentioned this issue Apr 24, 2019

Two lines paragraph highlight issue objcio/markdown-playgrounds#9

Open

chriseidhof mentioned this issue Apr 25, 2019

[wip] Add test info for softbreak #297

Open

chriszielinski mentioned this issue May 8, 2019

Fix source positions for inlines. #298

Open

gaborcsardi mentioned this issue Jul 10, 2022

Inline rmarkdown results inserted in wrong place r-lib/roxygen2#1353

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Source position seems off #296

Source position seems off #296

chriseidhof commented Apr 24, 2019

honghaoz commented Apr 24, 2019

jgm commented Apr 25, 2019

chriseidhof commented Apr 25, 2019

honghaoz commented Apr 26, 2019 •

edited

Loading

honghaoz commented Apr 26, 2019

kivikakk commented Apr 26, 2019

honghaoz commented Apr 26, 2019

jgm commented Apr 28, 2019

bibo38 commented Jan 13, 2023

Source position seems off #296

Source position seems off #296

Comments

chriseidhof commented Apr 24, 2019

honghaoz commented Apr 24, 2019

jgm commented Apr 25, 2019

chriseidhof commented Apr 25, 2019

honghaoz commented Apr 26, 2019 • edited Loading

honghaoz commented Apr 26, 2019

kivikakk commented Apr 26, 2019

honghaoz commented Apr 26, 2019

jgm commented Apr 28, 2019

bibo38 commented Jan 13, 2023

honghaoz commented Apr 26, 2019 •

edited

Loading