Dreadnaut support #651

pramothragavan · 2024-05-26T18:16:51Z

Broadly speaking, a dreadnaut file starts with "configuration" information about the graph, such as the number of vertices (denoted by 'n'), the start index for vertex numbering (denoted by '$') and whether or not a graph is a digraph (denoted by the presence of 'd'). The configuration section always ends with a 'g'. The rest of the file gives information concerning individual vertices in the form of adjacency lists. For example:

n=2
$=1
d
g
1: 1 2;
2: 2;

would represent a 1-indexed digraph with 2 vertices with edges {1,1}, {1,2}, {2,2}.

General overview:

Decoder:

DIGRAPHS_ParseDreadnautConfig aims to get values for either '$' (which indicates the start index for vertex numbering) or 'n' (which indicates the number of vertices). Note that '$' defaults to 0 and that I chose to reindex all graphs such that vertex numbering starts at one (which I think is convention for the Digraphs package?)
DIGRAPHS_LegalDreadnautEdge aims to filter out illegal edges and throws an error if an edge is illegal. An example of an illegal edge might be a loop for an undirected graph or an edge containing a vertex that is not allowed within the constraints of the values of '$' and 'n'. (In the case of illegal edges, nauty throws a warning message and then ignores the edge so I was trying to replicate this behaviour).
DIGRAPHS_SplitDreadnautLines effectively takes a line of dreadnaut (e.g. "1: 2 3 5; 4: 2 1 3; 2: 3;") and aims to split this into parts which are to be handled individually (in this case the parts would be ["1: 2 3 5;", "4: 2 1 3;", "2: 3;"]). The idea here is that although usually these parts would each be on their own line, it's techincally fine for some or all of them to share a line (with or without a semicolon) so I thought it made more sense to condense everything onto one line and then split into parts. There are various auxiliary commands that can be used within the dreadnaut format alongside the definition of the graph (more info here) which I mostly chose to neglect, with the exception of 'f' which defines a partition of vertices. Note that '$$' at the end of a file means reindex the graph to start counting at 0 (which I ignored).
DIGRAPHS_ParseDreadnautGraph intends to parse the non-configuration part of the file, which has been split into parts after being fed through to DIGRAPHS_SplitDreadnautLines

These are all combined in ReadDreadnautGraph.

Encoder:
WriteDreadnautGraph takes a digraph and encodes into dreadnaut format.

I'm in the process of writing documentation!

james-d-mitchell

Generally speaking this looks really good! I've added a few comments, mostly about the adding some details to the error messages if possible. Some general comments:

it'd be great if the error messages could report in what line of the file the error occurs, I think we discussed this, but don't exactly remember the outcome of this. One approach would be to store the original file contents in a variable, and then search within that for the part that causes the error. I'm not sure if this would actually work or not, just a thought.
in the PR description you mention:

Note that '$' defaults to 0 and that I chose to reindex all graphs such that vertex numbering starts at one (which I think is convention for the Digraphs package?)

this sounds appropriate, and yes this is the convention (really more a requirement in Digraphs, i.e. at present it's only possible to have digraphs with vertices [1 .. n] for some n). It'd be best if the code at the very least issued a warning when you are renumber the vertices, to avoid violating the principal of least astonishment (i.e. try to read a graph with nodes not [1 .. n], then silently getting a graph with nodes [1 .. n] would be surprising, so better issue a warning that this is happening). It'd also be useful to have the original vertices as labels in the newly constructed digraph, so if for example the graph has 0-indexed vertices, then the labels of the vertices in the output graph would be set using DigraphSetVertexLabels(D, [0 .. n - 1]); (if this is the correct mapping).

You mention in a couple of other places in the PR description that your code potentially ignores some other parts of dreadnaut files, if you detect parts that are ignored for whatever reason then, please issue a warning for each of these too, again to avoid surprising the user.
Have you checked how good the code coverage of your tests is? Given the number of lines of code in the implementation versus the number of lines of tests, I'm guessing that there's maybe some work to do there. I've sent you a python script by email that you can use to check the code coverage, just run ./code-coverage-test-gap.py tst/standard/io.tst inside the digraphs directory.

gap/io.gi

james-d-mitchell · 2024-05-30T08:43:33Z

@pramothragavan please let me know when you think this is ready again, and thanks !

pramothragavan · 2024-05-30T09:37:20Z

@pramothragavan please let me know when you think this is ready again, and thanks !

Will do!

…ore places where i could add in line on which error occurs

…m 1 2 : 3 4

mtorpey · 2025-01-28T15:15:03Z

Hi @pramothragavan! Looking forward to hopefully seeing you soon for the new VIP.

What state did this Dreadnaut project get to? Would it be a good thing for you to get back into this semester if there's work still to do on it?

pramothragavan · 2025-01-28T16:22:16Z

Hi — there are definitely some kinks to be dealt with, but I think it would be a good place to start!

…

On 28 Jan 2025, at 15:15, Michael Young ***@***.***> wrote: Hi @pramothragavan <https://github.com/pramothragavan>! Looking forward to hopefully seeing you soon for the new VIP. What state did this Dreadnaut project get to? Would it be a good thing for you to get back into this semester if there's work still to do on it? — Reply to this email directly, view it on GitHub <#651 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AZXCLQY4AOD2PKUEJQWZQVT2M6NI5AVCNFSM6AAAAABIJ72E4SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMMJZGI4TGNRWGU>. You are receiving this because you were mentioned.

… in memory

avoid "while true"

pramothragavan · 2025-02-08T18:41:38Z

This is a significant overhaul on previous versions -- WriteDreadnautGraph is untouched, but the decoder has been completely rewritten.

As @james-d-mitchell suggested, I've taken the parser used in the dreadnaut program and effectively rewritten it in GAP. The original C code uses a stream to parse character by character. GAP has a Stream object, but this lacks some of the functionality needed, so I created a record called Stream that aligns GAP's streams with how they're used in C. Other helper functions I've added:

DIGRAPHS_GETNWC finds the next character in the stream that is not in " ,\t"
DIGRAPHS_GETNWL finds the next character in the stream that is not in " \n\t\r"
DIGRAPHS_readinteger reads integers from the stream (i.e. avoiding issues with reading "10" as opposed to "1" and "0" that might arise when parsing character by character)
DIGRAPHS_GetInt also reads the next integer from the stream. There are some instances where dreadnaut allows for an optional '=' character (e.g. n=2 is the same as n2). This function ignores any '=' characters and then calls DIGRAPHS_readinteger.
DIGRAPHS_readgraph parses the graph's adjacency data
DIGRAPHS_ParsePartition is used to parse a partition, if given. The partition is stored using vertex labels.

Documentation for various commands is given here (pages 6-12). Many of these are used to manipulate the graph and I have focused on supporting commands more closely tied to directly defining the graph.

For now, I need to write (many) tests but I'm also interested if there are any commands that you'd like to see support for. I'm happy to implement anything really, but didn't want to waste time on things you didn't want. The commands that I am currently supporting are:

All of those mentioned in section (A) of the above link. In dreadnaut, these would just define the mode which dreadnaut is using. This is important for subsequent use of nauty/traces, but is irrelevant for actually reading in the graph so this is just ignored.
From section (B): n=#, g (and all subcommands), _, __
From section (C): f
From section (D): $=#, $$, +, d, -d
From section (F): "...", !, q

I think a couple of the unsupported commands from (B) might be worth looking into. Anything unsupported currently should raise an InfoWarning, with the exception of <, >, e (these three relate to reading in, outputting and editing graphs) which raise ErrorNoReturn.

pramothragavan added 30 commits March 16, 2024 11:17

Encoder function w/out tests

2dea414

WriteDreadnautGraph bug fix and prelim tests

a11aa50

ReadDreadnaut with tests passed

3e1087f

mend

b533858

ReadDreadnaut (untested)

8ef4d97

working functions, tests needed

781520b

more edge cases covered

0502023

more tests (passed)

c0417d7

dealing with whitespace, nodes in wrong order, dealing with .

23e4ab1

reading edges continued onto next line

51d5285

overline bug fix

eeba1db

. in its own line

a74be35

$$ and q support, additional tests

e2df61c

q functionality and legaledges

338c88a

syntax

45c3b44

graphdata sharing lines with config, increased readability

f20d7ee

multiple vertices on one line functionality

5366767

addded tests

e762c36

corrected splitline function

53572d3

r.dollarValue <> 1 support

24a8ef7

improved write function

33946fe

support for multiple $/n declarations

9115086

minor fixes

014c19f

condense onto one line, throw error instead of ignoring

df6a3b1

updated test outputs

9fd642c

cleanup

8094163

error messages

ad734fe

more error handling

fb1cab1

minor changes

52d7841

partition support

a65158b

pramothragavan added 3 commits May 27, 2024 22:13

gaplint

d49bbdc

fix .q support

16ead34

minor fixes

dfd4488

james-d-mitchell requested changes May 29, 2024

View reviewed changes

james-d-mitchell added new-feature A label for new features. waiting for creator input A label for issues/PRs where we are waiting for the creator to do something labels May 29, 2024

james-d-mitchell reviewed May 29, 2024

View reviewed changes

gap/io.gi Outdated Show resolved Hide resolved

pramothragavan added 4 commits May 29, 2024 19:21

minor changes: errors -> infowarnings, more informative warnings

e01b2aa

updated tests

e86bb9a

support f = #

b26d7e7

remove redundant argument

656a6bd

pramothragavan added 3 commits October 23, 2024 19:32

this works! need to make more robust + clean and there are probably m…

353c4ca

…ore places where i could add in line on which error occurs

passing tst, added support for multiple vertex declaration of the for…

88c6167

…m 1 2 : 3 4

support for lines w/out colons

ec209e9

pramothragavan added 11 commits February 8, 2025 03:35

revamp! taken from c

237f041

glint

66ddf5b

codespell

4c0f0da

updated tests

04c0c5d

close streams in error handling

77683e4

more warnings, improved c=# scenarios

2dae3f2

realised that sparse mode is just the same format, stored differently…

bb670fe

… in memory

_ support

61551b8

avoid "while true"

Improve various error messages to include line numbers

3ca9111

ignore changing between modes

e8ec6a4

fix reading "..." with \" in comment

06b9771

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dreadnaut support #651

Dreadnaut support #651

pramothragavan commented May 26, 2024 •

edited

Loading

james-d-mitchell left a comment

james-d-mitchell commented May 30, 2024

pramothragavan commented May 30, 2024

mtorpey commented Jan 28, 2025

pramothragavan commented Jan 28, 2025 via email

pramothragavan commented Feb 8, 2025

Dreadnaut support #651

Are you sure you want to change the base?

Dreadnaut support #651

Conversation

pramothragavan commented May 26, 2024 • edited Loading

james-d-mitchell left a comment

Choose a reason for hiding this comment

james-d-mitchell commented May 30, 2024

pramothragavan commented May 30, 2024

mtorpey commented Jan 28, 2025

pramothragavan commented Jan 28, 2025 via email

pramothragavan commented Feb 8, 2025

pramothragavan commented May 26, 2024 •

edited

Loading