fix(find.py): Correct span calculation for short-form citations #205

flooie · 2025-02-06T20:31:19Z

Short-form citations were incorrectly identifying the span and full span of a citation.

For example:

And Twombly, 550 U. S., at 555 …

Currently, when an antecedent guess is identified, it is not factored into the full span calculation. Additionally, the pin-cite is not correctly incorporated into the offset. This fix ensures both are properly accounted for.

This PR is meant to help address an issue discovered in identifying reference citations. The issue here is that
short case citations routinely develop an issue resolving the antecedent guess as a reference citation.

Short-form citations were incorrectly identifying the span and full span of a citation. For example: And Twombly, 550 U. S., at 555 … Currently, when an antecedent guess is identified, it is not factored into the full span calculation. Additionally, the pin-cite is not correctly incorporated into the offset. This fix ensures both are properly accounted for.

grossir

Seems like the "span_end" is not correct? Check output 17

text = "blah blah blah Adarand, 515 U.S., at 241 qdqwd lorem ipsum"

short = get_citations(text)[0]

In [15]: short
Out[15]: ShortCaseCitation('515 U.S., at 241', groups={'volume': '515', 'reporter': 'U.S.', 'page': '241'}, metadata=ShortCaseCitation.Metadata(parenthetical=None, pin_cite=None, year=None, court='scotus', antecedent_guess='Adarand'))

In [16]: text[short.full_span()[0]:short.full_span()[1]]
Out[16]: 'Adarand, 515 U.S., at 241'


# this last one is not working properly
In [17]: text[short.span()[0]:short.span()[1]]
Out[17]: '515 U.S., at '

flooie · 2025-02-06T21:27:01Z

@grossir that is the current functionality I believe. I didnt want to change that in this PR b

github-actions · 2025-02-07T13:59:16Z

The Eyecite Report 👁️

Gains and Losses

There were 0 gains and 0 losses.

Click here to see details.

id	Gain	Loss

Time Chart

Generated Files

Branch 1 Output
Branch 2 Output
Full Output CSV

flooie assigned flooie and grossir Feb 6, 2025

fix(lint): Fix lint issue

e0cff41

flooie assigned grossir and unassigned flooie and grossir Feb 6, 2025

grossir reviewed Feb 6, 2025

View reviewed changes

grossir assigned flooie and unassigned grossir Feb 6, 2025

flooie assigned grossir and unassigned flooie Feb 6, 2025

Merge branch 'main' into update-short-case-full-span

8aa0ffa

flooie merged commit 42dd315 into main Feb 7, 2025
13 checks passed

flooie deleted the update-short-case-full-span branch February 7, 2025 15:01

grossir mentioned this pull request Feb 11, 2025

Update Citation model's full span and regexes to account for ReferenceCitation overlaps #209

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(find.py): Correct span calculation for short-form citations #205

fix(find.py): Correct span calculation for short-form citations #205

flooie commented Feb 6, 2025

grossir left a comment

flooie commented Feb 6, 2025

github-actions bot commented Feb 7, 2025

fix(find.py): Correct span calculation for short-form citations #205

fix(find.py): Correct span calculation for short-form citations #205

Conversation

flooie commented Feb 6, 2025

grossir left a comment

Choose a reason for hiding this comment

flooie commented Feb 6, 2025

github-actions bot commented Feb 7, 2025

The Eyecite Report 👁️

Gains and Losses

Time Chart

Generated Files