diff: "lisp" userdiff_driver #2000

slburson · 2025-11-14T23:22:02Z

CC: Junio C Hamano [email protected], Johannes Sixt [email protected], Ævar Arnfjörð Bjarmason [email protected], Jaydeep P Das [email protected]

gitgitgadget · 2025-11-14T23:22:37Z

Welcome to GitGitGadget

Hi @slburson, and welcome to GitGitGadget, the GitHub App to send patch series to the Git mailing list from GitHub Pull Requests.

Please make sure that either:

Your Pull Request has a good description, if it consists of multiple commits, as it will be used as cover letter.
Your Pull Request description is empty, if it consists of a single commit, as the commit message should be descriptive enough by itself.

You can CC potential reviewers by adding a footer to the PR description with the following syntax:

CC: Revi Ewer <[email protected]>, Ill Takalook <[email protected]>

NOTE: DO NOT copy/paste your CC list from a previous GGG PR's description,
because it will result in a malformed CC list on the mailing list. See
example.

Also, it is a good idea to review the commit messages one last time, as the Git project expects them in a quite specific form:

the lines should not exceed 76 columns,
the first line should be like a header and typically start with a prefix like "tests:" or "revisions:" to state which subsystem the change is about, and
the commit messages' body should be describing the "why?" of the change.
Finally, the commit messages should end in a Signed-off-by: line matching the commits' author.

It is in general a good idea to await the automated test ("Checks") in this Pull Request before contributing the patches, e.g. to avoid trivial issues such as unportable code.

Contributing the patches

Before you can contribute the patches, your GitHub username needs to be added to the list of permitted users. Any already-permitted user can do that, by adding a comment to your PR of the form /allow. A good way to find other contributors is to locate recent pull requests where someone has been /allowed:

Search: is:pr is:open "/allow"

Both the person who commented /allow and the PR author are able to /allow you.

An alternative is the channel #git-devel on the Libera Chat IRC network:

<newcontributor> I've just created my first PR, could someone please /allow me? https://github.com/gitgitgadget/git/pull/12345
<veteran> newcontributor: it is done
<newcontributor> thanks!

Once on the list of permitted usernames, you can contribute the patches to the Git mailing list by adding a PR comment /submit.

If you want to see what email(s) would be sent for a /submit request, add a PR comment /preview to have the email(s) sent to you. You must have a public GitHub email address for this. Note that any reviewers CC'd via the list in the PR description will not actually be sent emails.

After you submit, GitGitGadget will respond with another comment that contains the link to the cover letter mail in the Git mailing list archive. Please make sure to monitor the discussion in that thread and to address comments and suggestions (while the comments and suggestions will be mirrored into the PR by GitGitGadget, you will still want to reply via mail).

If you do not want to subscribe to the Git mailing list just to be able to respond to a mail, you can download the mbox from the Git mailing list archive (click the (raw) link), then import it into your mail program. If you use GMail, you can do this via:

curl -g --user "<EMailAddress>:<Password>" \
    --url "imaps://imap.gmail.com/INBOX" -T /path/to/raw.txt

To iterate on your change, i.e. send a revised patch or patch series, you will first want to (force-)push to the same branch. You probably also want to modify your Pull Request description (or title). It is a good idea to summarize the revision by adding something like this to the cover letter (read: by editing the first comment on the PR, i.e. the PR description):

Changes since v1:
- Fixed a typo in the commit message (found by ...)
- Added a code comment to ... as suggested by ...
...

To send a new iteration, just add another PR comment with the contents: /submit.

Need help?

New contributors who want advice are encouraged to join [email protected], where volunteers who regularly contribute to Git are willing to answer newbie questions, give advice, or otherwise provide mentoring to interested contributors. You must join in order to post or view messages, but anyone can join.

You may also be able to find help in real time in the developer IRC channel, #git-devel on Libera Chat. Remember that IRC does not support offline messaging, so if you send someone a private message and log out, they cannot respond to you. The scrollback of #git-devel is archived, though.

slburson · 2025-11-14T23:29:44Z

@dscho Could I trouble you for a /allow?

The "scheme" driver doesn't quite work for Common Lisp. This driver is very generic and should work for almost any dialect of Lisp, including Common Lisp. Signed-off-by: Scott L. Burson <[email protected]>

dscho · 2025-11-15T06:46:32Z

/allow

gitgitgadget · 2025-11-15T06:47:26Z

User slburson is now allowed to use GitGitGadget.

slburson · 2025-11-15T09:15:01Z

@dscho Thanks!

slburson · 2025-11-15T09:15:08Z

/preview

gitgitgadget · 2025-11-15T09:16:15Z

Preview email sent as [email protected]

slburson · 2025-11-15T10:16:50Z

/submit

gitgitgadget · 2025-11-15T10:17:58Z

Submitted as [email protected]

To fetch this version into FETCH_HEAD:

git fetch https://github.com/gitgitgadget/git/ pr-2000/slburson/lisp-userdiff_driver-v1

To fetch this version to local tag pr-2000/slburson/lisp-userdiff_driver-v1:

git fetch --no-tags https://github.com/gitgitgadget/git/ tag pr-2000/slburson/lisp-userdiff_driver-v1

gitgitgadget · 2025-11-15T17:51:34Z

On the Git mailing list, Johannes Sixt wrote (reply to this):

[Cc the author of the Scheme driver]

Am 15.11.25 um 11:17 schrieb Scott L. Burson via GitGitGadget:
> From: "Scott L. Burson" <[email protected]>
> 
> The "scheme" driver doesn't quite work for Common Lisp.  This driver
> is very generic and should work for almost any dialect of Lisp,
> including Common Lisp.

"Doesn't quite work" is unfortunately a description of the problem that
does not help a reviewer decide whether this change is justified. Please
add a lot more details why the Scheme driver is unsuitable for Lisp and
why a new driver is needed.

It is customary to mark changes to the drivers in the subject line with
"userdiff:". Have a look at `git log userdiff.c`. It would be
appreciated to stay away from nerdy tokens like "userdiff_driver" when
the change can be summarized in plain English language.

> 
> Signed-off-by: Scott L. Burson <[email protected]>
> ---
>     diff: "lisp" userdiff_driver
> 
> Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-2000%2Fslburson%2Flisp-userdiff_driver-v1
> Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-2000/slburson/lisp-userdiff_driver-v1
> Pull-Request: https://github.com/gitgitgadget/git/pull/2000
> 
>  userdiff.c | 8 ++++++++
>  1 file changed, 8 insertions(+)
> 
> diff --git a/userdiff.c b/userdiff.c
> index fe710a68bf..e127b4a1f1 100644
> --- a/userdiff.c
> +++ b/userdiff.c
> @@ -249,6 +249,14 @@ PATTERNS("kotlin",
>  	 "|[.][0-9][0-9_]*([Ee][-+]?[0-9]+)?[fFlLuU]?"
>  	 /* unary and binary operators */
>  	 "|[-+*/<>%&^|=!]==?|--|\\+\\+|<<=|>>=|&&|\\|\\||->|\\.\\*|!!|[?:.][.:]"),
> +PATTERNS("lisp",
> +	 /* Either an unindented left paren, or a slightly indented line
> +	  * starting with "(def" */
> +	 "^((\\(|:space:{1,2}\\(def).*)$",

Compared to the Scheme driver, this regular expression is

- more restrictive because it does not permit arbitrary indentation;

- less restrictive because it permits everything that begins with "(def".

What would happen if this regular expression were added to the Scheme
driver? Would it pick up additional and unwanted hunk headers is typical
Scheme code?

Just so you know: The string literal for hunk headers can contain "\n"
to separate regular expressions. For a given line, they are attempted to
match in order until the first match is found.

> +	 /* Common Lisp symbol syntax allows arbitrary strings between vertical bars */
> +	 "\\|([^\\\\]|\\\\\\\\|\\\\\\|)*\\|"

The Scheme driver has an similar description of this word token, but it
has only half as many backslashes. Is the difference necessary? Isn't
actually one or the other incorrect? (I did not try to understand what
this version here does.)

> +	 /* All other words are delimited by spaces or parentheses/brackets/braces */
> +	 "|([^][(){} \t])+"),

This one looks identical to the same line in Scheme with a slightly
altered comment.

In summary, I feel like this could be unified with the Scheme driver,
except when awkward false positive hunk headers in existing Scheme code
are to be expected. That's a question that I cannot answer.

-- Hannes

gitgitgadget · 2025-11-15T23:40:46Z

On the Git mailing list, "Scott L. Burson" wrote (reply to this):

On Sat, Nov 15, 2025 at 9:06 AM Johannes Sixt <[email protected]> wrote:
>
> [Cc the author of the Scheme driver]
>
> Am 15.11.25 um 11:17 schrieb Scott L. Burson via GitGitGadget:
> > From: "Scott L. Burson" <[email protected]>
>
> Please
> add a lot more details why the Scheme driver is unsuitable for Lisp and
> why a new driver is needed.

Here is text I propose for the commit message:

----
Common Lisp has top-level forms 'defun' and 'deftype' that are not
matched by the current Scheme pattern.  Also, it is more common when
defining user macros intended as top-level forms to prefix their names
with "def" instead of "define"; such forms are also not matched.  And
some such forms don't even begin with "def".

On the other hand, it is an established formatting convention in the
Lisp community that only top-level forms start at the left margin.  So
matching any unindented line starting with an open parenthesis is an
acceptable heuristic; false positives will be rare.

However, there are also cases where notionally top-level forms are
grouped together within some containing form.  At least in the Common
Lisp community, it is conventional to indent these by two spaces, or
sometimes one.  But matching just an open parenthesis indented by two
spaces would be too broad; so the pattern added by this commit
requires an indented form to start with "(def".  It is believed that
this strikes a good balance between potential false positives and
false negatives.
----

I discussed the pattern with some other experienced Common Lisp
developers on a mailing list, and this is what I settled on after
incorporating their feedback.

> It is customary to mark changes to the drivers in the subject line with
> "userdiff:". Have a look at `git log userdiff.c`. It would be
> appreciated to stay away from nerdy tokens like "userdiff_driver" when
> the change can be summarized in plain English language.

Will do.

> >
> > Signed-off-by: Scott L. Burson <[email protected]>
> > ---
> >     diff: "lisp" userdiff_driver
> >
> > Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-2000%2Fslburson%2Flisp-userdiff_driver-v1
> > Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-2000/slburson/lisp-userdiff_driver-v1
> > Pull-Request: https://github.com/gitgitgadget/git/pull/2000
> >
> >  userdiff.c | 8 ++++++++
> >  1 file changed, 8 insertions(+)
> >
> > diff --git a/userdiff.c b/userdiff.c
> > index fe710a68bf..e127b4a1f1 100644
> > --- a/userdiff.c
> > +++ b/userdiff.c
> > @@ -249,6 +249,14 @@ PATTERNS("kotlin",
> >        "|[.][0-9][0-9_]*([Ee][-+]?[0-9]+)?[fFlLuU]?"
> >        /* unary and binary operators */
> >        "|[-+*/<>%&^|=!]==?|--|\\+\\+|<<=|>>=|&&|\\|\\||->|\\.\\*|!!|[?:.][.:]"),
> > +PATTERNS("lisp",
> > +      /* Either an unindented left paren, or a slightly indented line
> > +       * starting with "(def" */
> > +      "^((\\(|:space:{1,2}\\(def).*)$",
>
> Compared to the Scheme driver, this regular expression is
>
> - more restrictive because it does not permit arbitrary indentation;
>
> - less restrictive because it permits everything that begins with "(def".
>
> What would happen if this regular expression were added to the Scheme
> driver? Would it pick up additional and unwanted hunk headers is typical
> Scheme code?

That is a good question.  I don't think so, but I don't work in Scheme.
I see that you have CC'ed Atharva Raykar; let's see whether he would
have any objection.

I would point out that Scheme is a dialect of Lisp, not the other way
around.  (Lisp is unusual in being a family of languages, rather than a
single language.)  And having a separate "lisp" driver might aid
discoverability.

But I understand: Scheme got their driver in first, and you have to fight
against the tendency of the driver list to grow unboundedly.

Ooh, that reminds me: if we do decide to add a "lisp" driver, I'll also need
to add it to 'Documentation/gitattributes.adoc'.

> The string literal for hunk headers can contain "\n"

Noted.

> > +      /* Common Lisp symbol syntax allows arbitrary strings between vertical bars */
> > +      "\\|([^\\\\]|\\\\\\\\|\\\\\\|)*\\|"
>
> The Scheme driver has an similar description of this word token, but it
> has only half as many backslashes. Is the difference necessary? Isn't
> actually one or the other incorrect? (I did not try to understand what
> this version here does.)

It's not important, but technically, Common Lisp allows an escaped
backslash between vertical bars, but the R7RS formal grammar does not.
However, I just tried Chicken Scheme, which claims to be at least
partially R7RS compliant, and it does accept the escaped backslash.  I
am left to conclude that Scheme implementors think that the omission
of the escaped backslash from the R7RS formal grammar is an oversight
(I think so too).

Of course, no one would actually write a symbol name with an escaped
backslash in it unless they were submitting to an obfuscated Lisp
contest.  So we are really being pedantic here.  Still, may as well allow
it.

Atharva, any comments?

-- Scott

gitgitgadget · 2025-11-16T05:42:17Z

On the Git mailing list, Junio C Hamano wrote (reply to this):

Johannes Sixt <[email protected]> writes:

>> +	 /* Either an unindented left paren, or a slightly indented line
>> +	  * starting with "(def" */
>> +	 "^((\\(|:space:{1,2}\\(def).*)$",
>
> Compared to the Scheme driver, this regular expression is
>
> - more restrictive because it does not permit arbitrary indentation;
>
> - less restrictive because it permits everything that begins with "(def".
>
> What would happen if this regular expression were added to the Scheme
> driver? Would it pick up additional and unwanted hunk headers is typical
> Scheme code?

As we generally assume that the file being edited is syntactically
sound, even if one lisp variant understands "(deffoo" and others do
not, it should be generally fine for the pattern to say something
like "at the beginning of the line, optionally following a few
spaces, four-letter sequence '(def' is likely to be the beginning of
a function definition", as long as there is some convention that
user defined functions and macros, unless they are to behave
similarly to "(defun", would not be named so confusingly to start
with d-e-f.

It would be nice if a single set of rules can cover what existing
scheme patterns cover, Emacs lisp, and Common lisp.

gitgitgadget bot added the new user label Nov 14, 2025

diff: "lisp" userdiff_driver

da99bb0

The "scheme" driver doesn't quite work for Common Lisp. This driver is very generic and should work for almost any dialect of Lisp, including Common Lisp. Signed-off-by: Scott L. Burson <[email protected]>

slburson force-pushed the lisp-userdiff_driver branch from ccf37a5 to da99bb0 Compare November 14, 2025 23:30

slburson changed the title ~~"lisp" userdiff_driver~~ diff: "lisp" userdiff_driver Nov 14, 2025

diff: "lisp" userdiff_driver #2000

Are you sure you want to change the base?

diff: "lisp" userdiff_driver #2000

Uh oh!

Conversation

slburson commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gitgitgadget bot commented Nov 14, 2025

Welcome to GitGitGadget

Contributing the patches

Need help?

Uh oh!

slburson commented Nov 14, 2025

Uh oh!

dscho commented Nov 15, 2025

Uh oh!

gitgitgadget bot commented Nov 15, 2025

Uh oh!

slburson commented Nov 15, 2025

Uh oh!

slburson commented Nov 15, 2025

Uh oh!

gitgitgadget bot commented Nov 15, 2025

Uh oh!

slburson commented Nov 15, 2025

Uh oh!

gitgitgadget bot commented Nov 15, 2025

Uh oh!

gitgitgadget bot commented Nov 15, 2025

Uh oh!

gitgitgadget bot commented Nov 15, 2025

Uh oh!

gitgitgadget bot commented Nov 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

slburson commented Nov 14, 2025 •

edited

Loading