patches and low-level development discussion
 help / color / mirror / code / Atom feed
From: Demi Marie Obenour <demiobenour@gmail.com>
To: Alyssa Ross <hi@alyssa.is>
Cc: Spectrum OS Development <devel@spectrum-os.org>
Subject: Re: [PATCH v5] Generate file lists from a script
Date: Sat, 27 Sep 2025 04:42:02 -0400	[thread overview]
Message-ID: <d10cfc57-0b59-4ee7-9c6a-7b326adb764a@gmail.com> (raw)
In-Reply-To: <87tt0ob93x.fsf@alyssa.is>


[-- Attachment #1.1.1: Type: text/plain, Size: 7397 bytes --]

On 9/27/25 04:19, Alyssa Ross wrote:
> Demi Marie Obenour <demiobenour@gmail.com> writes:
> 
>> Right now, the makefiles in host/rootfs, vm/sys/net, and img/app have
>> manually-maintained lists of files and symlinks.  These duplicate the
>> information in the git repository and can easily get out of sync or
>> cause unnecessary merge conflicts.  Fix all of these issues by having
>> the git repository be the source of truth, and using a script to
>> generate the file lists.  Developers can regenerate the lists before
>> every commit, or even add a git hook to do that.
>>
>> Signed-off-by: Demi Marie Obenour <demiobenour@gmail.com>
>> ---
>> This actually reduces the amount of code that has to be written by hand.
>> ---
> 
> This is so close.  Was almost at the point where I could have just fixed
> it up myself and committed it, but there's one thing I want to check
> with you.

Thank you for being careful!

>> Changes in v5:
>> - Use 'print ""' instead of 'print' in awk to print a newline.  'print'
>>   with no arguments implicitly prints $0 instead.  This caused the
>>   generated makefiles to be incorrect.
>> - Use S6_RC_FILES instead of VM_S6_RC_FILES in vm/sys/net/Makefile and
>>   img/app/Makefile.  This prevented the image from being built.
>> - Do not check for git repository being in a directory with a name
>>   ending in a newline.
>> - Use shell redirection instead of awk redirection.
>> - Do not include trailing DONE line in input to awk.
>> - Link to v4: https://lore.kernel.org/r/20250921-genfiles-v4-1-4375bda78707@gmail.com
>>
>> Changes in v4:
>> - Use /bin/sh instead of bash.
>> - Do not assume that negated awk character classes match all bytes.
>> - Do not check the mode of license files.
>> - Use implicit awk variable initialization.
>> - Use 'git rev-parse --show-toplevel' to find the repository root.
>> - Remove wrongly added copyright header.
>> - Improve documentation.
>> - Remove git hooks.
>> - Add missing copyright header.
>> - Avoid non-portable /usr/bin/env -S.
>> - Avoid assuming that awk is GNU awk.
>> - Avoid non-portable awk -E.
>> - Do not check for git bugs.
>> - Fix link in v3 changelog.
>> - Link to v3: https://spectrum-os.org/lists/archives/spectrum-devel/20250920-genfiles-v3-1-d6c2b6767b42@gmail.com
>>
>> Changes in v3:
>> - Only include the file list generator.  Move the rest to separate patch
>>   series.
>> - Remove the update-file-list make targets from img/app/Makefile and
>>   vm/sys/net/Makefile.
>> - Link to v2: https://spectrum-os.org/lists/archives/spectrum-devel/20250910-genfiles-v2-0-37ebe07a3cdc@gmail.com
>>
>> Changes in v2:
>> - Drop the last patch (switching to /etc/s6-rc/compiled) as it is
>>   controversial and should be reviewed separately.
>> - Add missing copyright notices.
>> - Use a wrapper shell script to make the awk code easier to read.
>> - Improve documentation.
>> - Add helper scripts for use in git hooks and rebasing.
>> - Link to v1: https://spectrum-os.org/lists/archives/spectrum-devel/20250903-genfiles-v1-0-cc993fcb1e4c@gmail.com/
>> ---
>>  Documentation/development/built-in-vms.adoc |   7 ++
>>  host/rootfs/Makefile                        | 102 +---------------------------
>>  host/rootfs/file-list.mk                    |  99 +++++++++++++++++++++++++++
>>  img/app/Makefile                            |  82 ++++------------------
>>  img/app/file-list.mk                        |  65 ++++++++++++++++++
>>  scripts/genfiles.awk                        |  77 +++++++++++++++++++++
>>  scripts/genfiles.sh                         |  24 +++++++
>>  vm/sys/net/Makefile                         |  52 +++-----------
>>  vm/sys/net/file-list.mk                     |  42 ++++++++++++
>>  9 files changed, 337 insertions(+), 213 deletions(-)
>>
>> diff --git a/host/rootfs/file-list.mk b/host/rootfs/file-list.mk
>> new file mode 100644
>> index 0000000000000000000000000000000000000000..58cda39f85f720ab46f025bc72f1a98f108f1c25
>> --- /dev/null
>> +++ b/host/rootfs/file-list.mk
>> @@ -0,0 +1,99 @@
>> +# SPDX-License-Identifier: EUPL-1.2+
>> +# SPDX-FileCopyrightText: 2021-2024 Alyssa Ross <hi@alyssa.is>
> 
> Only 2021.  The only thing I think is even arguably copyrightable is the
> comment.  And following that principle let's actually put your copyright
> in here too.

I'd rather just drop the comment.  It can be phrased better anyway,
and it is much cleaner for the generated file to only contain data.

>> +# Generated by scripts/genfile.sh.  Any changes will be overwritten.
> 
> Let's have a blank line before and after this comment, for readability.
> 
>> diff --git a/scripts/genfiles.awk b/scripts/genfiles.awk
>> new file mode 100644
>> index 0000000000000000000000000000000000000000..935eebbdc7f0e2aa07b0b6439ab53d1f50940929
>> --- /dev/null
>> +++ b/scripts/genfiles.awk
>> @@ -0,0 +1,77 @@
>> +# SPDX-License-Identifier: EUPL-1.2+
>> +# SPDX-FileCopyrightText: 2021-2024 Alyssa Ross <hi@alyssa.is>
> 
> I wouldn't include this one.
See above.

>> +# SPDX-FileCopyrightText: 2025 Demi Marie Obenour <demiobenour@gmail.com>
>> +BEGIN {
>> +	RS = "\n";
>> +	FS = "\t";
>> +	modes["120000"] = "symlink";
>> +	modes["100644"] = "regular";
>> +	modes["100755"] = "regular";
>> +}
>> +
>> +function fail(msg) {
>> +	exit_code = 1;
>> +	print msg > "/dev/stderr";
>> +	exit 1;
>> +}
>> +
>> +# Extract data from built-in variables.
>> +{
>> +	filename = $2;
>> +	raw_mode = $1;
>> +	# awk autocreates empty string entries if the key is invalid,
>> +	# but the code exits in this case so that is okay.
>> +	mode = modes[raw_mode];
>> +}
>> +
>> +filename !~ /^[[:alnum:]_./-]+$/ {
>> +	fail("filename '" filename "' has forbidden characters");
>> +}
>> +
>> +# Skip license files
>> +/\.license$/ { next }
>> +
>> +filename ~ /^image\/etc\/s6-rc\// {
>> +	if (mode != "regular") {
>> +		fail("s6-rc-compile input '" filename "' isn't a regular file");
>> +	}
>> +	rc_files[rc_count++] = filename;
>> +	next;
>> +}
>> +
>> +mode == "symlink" {
>> +	symlinks[symlink_count++] = filename;
>> +	next;
>> +}
>> +
>> +mode == "regular" {
>> +	files[file_count++] = filename;
>> +	next;
>> +}
>> +
>> +{ fail("File '" filename "' is not regular file or symlink (mode " raw_mode ")"); }
>> +
>> +END {
>> +	if (exit_code) {
>> +		exit exit_code;
>> +	}
> 
> I don't think this ever happens?
> 
>> +	printf ("# SPDX-License-Identifier: EUPL-1.2+\n" \
>> +"# SPDX-FileCopyrightText: 2021-2024 Alyssa Ross <hi@alyssa.is>\n" \
>> +"# Generated by scripts/genfile.sh.  Any changes will be overwritten.\n" \
>> +"FILES =");
>> +	for (array_index = 0; array_index < file_count; array_index += 1) {
>> +		printf " \\\n\t%s", files[array_index];
>> +	}
>> +	printf ("\n\n" \
>> +"# These are separate because they need to be included, but putting\n" \
>> +"# them as make dependencies would confuse make.\n" \
>> +"LINKS =");
>> +	for (array_index = 0; array_index < symlink_count; array_index += 1) {
>> +		printf " \\\n\t%s", symlinks[array_index];
>> +	}
>> +	printf "\n\nS6_RC_FILES =";
>> +	for (array_index = 0; array_index < rc_count; array_index += 1) {
>> +		printf " \\\n\t%s", rc_files[array_index];
>> +	}
>> +	print "";
>> +}


-- 
Sincerely,
Demi Marie Obenour (she/her/hers)

[-- Attachment #1.1.2: OpenPGP public key --]
[-- Type: application/pgp-keys, Size: 7253 bytes --]

[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 833 bytes --]

  reply	other threads:[~2025-09-27  8:42 UTC|newest]

Thread overview: 51+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-09-04  1:56 [PATCH 0/4] Generate file lists from a script Demi Marie Obenour
2025-09-04  1:56 ` [PATCH 1/4] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-04  1:56 ` [PATCH 2/4] Generate makefile file lists from a script Demi Marie Obenour
2025-09-08  9:59   ` Alyssa Ross
2025-09-08 18:45     ` Demi Marie Obenour
2025-09-09 14:51       ` Alyssa Ross
2025-09-04  1:56 ` [PATCH 3/4] Common make rules for building erofs images Demi Marie Obenour
2025-09-08 10:01   ` Alyssa Ross
2025-09-08 18:53     ` Demi Marie Obenour
2025-09-09 14:56       ` Alyssa Ross
2025-09-04  1:56 ` [PATCH 4/4] Use /etc/s6-rc/compiled for compiled s6-rc directory Demi Marie Obenour
2025-09-10  5:29 ` [PATCH v2 0/3] Generate file lists from a script Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 1/3] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-10 18:58     ` Alyssa Ross
2025-09-11 12:21       ` Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 2/3] Generate makefile file lists from a script Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 3/3] Common make rules for building erofs images Demi Marie Obenour
2025-09-11 12:47   ` [PATCH v3 0/4] Generate file lists from a script Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 1/4] Do not ignore errors from tar Demi Marie Obenour
2025-09-17 11:48       ` Alyssa Ross
2025-09-18  2:45         ` Demi Marie Obenour
2025-09-19  7:46           ` Alyssa Ross
2025-09-30 12:59             ` Alyssa Ross
2025-09-19  7:55       ` Alyssa Ross
2025-09-19 19:03         ` Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 2/4] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-17 12:30       ` Alyssa Ross
2025-09-17 12:39       ` Alyssa Ross
2025-09-17 13:03       ` Alyssa Ross
2025-09-11 12:47     ` [PATCH v3 3/4] Generate makefile file lists from a script Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 4/4] Common make rules for building erofs images Demi Marie Obenour
2025-09-21  2:23   ` [PATCH v3] Generate file lists from a script Demi Marie Obenour
2025-09-21  8:47     ` Alyssa Ross
2025-09-21 16:51       ` Demi Marie Obenour
2025-09-21 17:07         ` Alyssa Ross
2025-09-21 17:24     ` [PATCH v4] " Demi Marie Obenour
2025-09-25 11:22       ` Alyssa Ross
2025-09-26 16:31       ` [PATCH v5] " Demi Marie Obenour
2025-09-27  8:19         ` Alyssa Ross
2025-09-27  8:42           ` Demi Marie Obenour [this message]
2025-09-27 16:22         ` [PATCH v6] " Demi Marie Obenour
2025-09-29  8:12           ` Alyssa Ross
2025-09-29 17:20             ` Demi Marie Obenour
2025-09-29 17:18           ` [PATCH v7] " Demi Marie Obenour
2025-10-01  9:20             ` Alyssa Ross
2025-10-01  9:24               ` Demi Marie Obenour
2025-10-01  9:35             ` Alyssa Ross
2025-10-01 18:30             ` [PATCH v8] " Demi Marie Obenour
2025-10-02  9:46               ` Alyssa Ross
2025-10-02 17:37               ` [PATCH v9] " Demi Marie Obenour
2025-10-03  9:04                 ` Alyssa Ross

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=d10cfc57-0b59-4ee7-9c6a-7b326adb764a@gmail.com \
    --to=demiobenour@gmail.com \
    --cc=devel@spectrum-os.org \
    --cc=hi@alyssa.is \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://spectrum-os.org/git/crosvm
	https://spectrum-os.org/git/doc
	https://spectrum-os.org/git/mktuntap
	https://spectrum-os.org/git/nixpkgs
	https://spectrum-os.org/git/spectrum
	https://spectrum-os.org/git/ucspi-vsock
	https://spectrum-os.org/git/www

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).