ingydotnet · August 27, 2012 15:32 · SineSwiper · Aug 27, 2012
diff --git a/gistfile1.txt b/gistfile1.txt
 19:01 <mst> http://paste.scsys.co.uk/206434
 19:02 <mst> thoughts?
 16:32 <ingy> my first thought is that SineSwiper is wrong in his assessment that parsing is better with lexing
 16:32 <ingy> with a separate lexing step
 16:33 <ingy> this is a majot problem with coffeescript
 16:33 <ingy> which I am currently trying to fix
 16:35 <ingy> I think SS thinks that RecDescent means that you parse too deeply and waste time, but that is a simply a matter of writing good grammars vs poor ones.
 16:35 <ingy> My grammars are careful never to need much lookahead
 16:36 <ingy> because I was thinking about that when I wrote them
 16:36 <ingy> anyway, I think it's weird that he didn't talk to me about it
 16:37 <ingy> the thing he should do is write some simple failing tests
 16:37 <ingy> it is entirely possible that Pegex has flaws. almost certain in fact.
 16:38 <ingy> but unless he can come up with test cases that I can't workaround with good grammar writing, I am certainly not going to entertain lexing

 ... in #cdent

 16:39 <@ingy> mst just made me aware of people griping that Pegex is not a Lex/Parse setup
 16:39 <@ingy> and hi mst :)
 16:40 <@ingy> Pegex does both lexing and parsing at the same time
 16:41 <@ingy> so coffeescript does a full lex. then a lexical analysis, then a finally a grammar parse
 16:41 <@ingy> with lots of code in every stage
 16:42 <@ingy> the pegex way is to do this all at once, and unless I'm sorely mistaken, Pegex parsing of CoffeeScript will be faster, and also work around all the heinous corners that coffee has painted itself into
 16:43 <@ingy> func() if bool  # is required to be on one line in coffeee
 16:44 <@ingy> this sucks when that expression grows past 80 columns
 16:45 <@ingy> but the lexer would assign a newline to be a terminator token, and the parser isn't expecting that
 16:45 <@ingy> which is not to say that the parser can't be smarter
 16:46 <@ingy> but in my experience requiring the lexing to be completely separate from the parsing throws away too much context, and leaves you having to invent ways to deal with the resulting problems
 16:50  * sevvie smiles.
 16:51 < sevvie> It just sounds like you lex well, to me.
 16:53 < sevvie> It does bring up the question; what benefits does a balanced lexer and parser provide? (For all intents and purposes, I know nothing.)
 16:54 <@ingy> mst: sevvie just made me realize that pegex could be be setup to be "just a lexer"
 16:54 <@ingy> I should have examples of this
 16:55 <@ingy> doing it both ways
	19:01 <mst> http://paste.scsys.co.uk/206434
	19:02 <mst> thoughts?
	16:32 <ingy> my first thought is that SineSwiper is wrong in his assessment that parsing is better with lexing
	16:32 <ingy> with a separate lexing step
	16:33 <ingy> this is a majot problem with coffeescript
	16:33 <ingy> which I am currently trying to fix
	16:35 <ingy> I think SS thinks that RecDescent means that you parse too deeply and waste time, but that is a simply a matter of writing good grammars vs poor ones.
	16:35 <ingy> My grammars are careful never to need much lookahead
	16:36 <ingy> because I was thinking about that when I wrote them
	16:36 <ingy> anyway, I think it's weird that he didn't talk to me about it
	16:37 <ingy> the thing he should do is write some simple failing tests
	16:37 <ingy> it is entirely possible that Pegex has flaws. almost certain in fact.
	16:38 <ingy> but unless he can come up with test cases that I can't workaround with good grammar writing, I am certainly not going to entertain lexing

	... in #cdent

	16:39 <@ingy> mst just made me aware of people griping that Pegex is not a Lex/Parse setup
	16:39 <@ingy> and hi mst :)
	16:40 <@ingy> Pegex does both lexing and parsing at the same time
	16:41 <@ingy> so coffeescript does a full lex. then a lexical analysis, then a finally a grammar parse
	16:41 <@ingy> with lots of code in every stage
	16:42 <@ingy> the pegex way is to do this all at once, and unless I'm sorely mistaken, Pegex parsing of CoffeeScript will be faster, and also work around all the heinous corners that coffee has painted itself into
	16:43 <@ingy> func() if bool # is required to be on one line in coffeee
	16:44 <@ingy> this sucks when that expression grows past 80 columns
	16:45 <@ingy> but the lexer would assign a newline to be a terminator token, and the parser isn't expecting that
	16:45 <@ingy> which is not to say that the parser can't be smarter
	16:46 <@ingy> but in my experience requiring the lexing to be completely separate from the parsing throws away too much context, and leaves you having to invent ways to deal with the resulting problems
	16:50 * sevvie smiles.
	16:51 < sevvie> It just sounds like you lex well, to me.
	16:53 < sevvie> It does bring up the question; what benefits does a balanced lexer and parser provide? (For all intents and purposes, I know nothing.)
	16:54 <@ingy> mst: sevvie just made me realize that pegex could be be setup to be "just a lexer"
	16:54 <@ingy> I should have examples of this
	16:55 <@ingy> doing it both ways