Make use of recipe-scrapers for more generic web imports #18

cydanil · 2021-08-20T13:44:08Z

https://github.com/hhursev/recipe-scrapers provides a nice interface to scrape a number of websites, which could replace our importers for more flexible imports.

This would also remove the burden for us to maintain importers on top of the application itself, as well as support way more websites than we could.

cydanil · 2021-09-02T11:19:08Z

I have a branch that does this.

The import dialog was reworked to be a single dialog for file and web imports, with validation of the supported urls:

File import behaves the same.

Web recipes are imported using recipe-scrapers, eg https://www.allrecipes.com/recipe/257887/lunch-biscuits/ :

However, there's still a couple of items needing improvements:

Separate the web_importer from Gourmand internals;
Create web import status;
Set the cooking and preparation times.

The motivation for 1 is to be able to plug in more web importers in a homogeneous should the need arise, and to decouple the import mechanism from user feedback.

web imports still take a few seconds, mostly related to retrieving the recipe image. As such, it would be beneficial to have the import status in an info bar, as file imports do it.

cydanil · 2021-09-04T10:57:25Z

To finalize this feature, an import status still needs to be implemented: recipes take a while to import, mostly due to retrieving and scaling images.

eliotb · 2021-09-11T22:44:59Z

Does this change mean that it will only be possible to import from one of the supported websites, rather than from any webpage with a recipe on it? I notice that the recipe scraper library has a sort of wild card option that may work on sites that are not directly supported
If so, workaround is to save page as a file, or copy and paste text into plain text file, save that and then import it.
Related - to shortcut the copy,paste, save, import sequence would you be open to supporting direct import from the clipboard?

cydanil · 2021-10-09T15:51:18Z

I would have preferred to support these websites indeed, but it's not realistic.
Their wildcard option still expects the recipes to be formatted in a specific way (json-ld).

I would prioritise recipe-scrapers.

I like your suggesting of importing recipes by pasting selected text (or recipe file) in a window (the main window?). I will prototype something and ask for your feedback :)

founderio · 2021-10-09T15:54:33Z

@cydanil Someone made a wrapper to text parsing here: hhursev/recipe-scrapers#9 (comment)

Maybe that can be utilized?

cydanil · 2021-10-09T16:30:09Z

Thanks for pointing me towards this comment, @founderio. However, it seems to still expect SchemaOrg content.
I tried to use it with a recipe using basic html (which I think will be the #1 need) to no avail.

I picture the imports to be unstructured text as some variation of what's below:

text = """
<html>
<title>Recipe title </title>
<br/>
Ingredients:

Ingredient Group 1:

    1 ⅓ cups all-purpose flour
    1 tablespoon granulated sugar
    ½ teaspoon salt
    ½ cup shortening
    3 ½ tablespoons cold water

Ingredient Group 2:

    2 cups mashed, cooked pumpkin, or about 1 1/2 pounds skin-on, raw pumpkin; Shortcut: You can substitute with canned pumpkin. Before you buy, you should check the label to make sure of what's in the can; if it's labeled "pumpkin pie filling" it's already spiced. If you plan to sweeten yourself with the ingredients below, go for unflavored canned pumpkin.
    1 (12 fluid ounce) can evaporated milk
    2 eggs, beaten
    ¾ cup packed brown sugar
    ½ teaspoon ground cinnamon
    ½ teaspoon ground ginger
    ½ teaspoon ground nutmeg
    ½ teaspoon salt
<br/>
whole pumpkin for pie

1. Instructions Part 1
blabla

2. Instructions Part 2
text

3. Instruction Part 3
text
</html>
"""

eliotb · 2021-10-09T23:00:15Z

I like your suggesting of importing recipes by pasting selected text (or recipe file) in a window (the main window?).

I have just realised that the traditional importer (i.e. with the text pane and all the tag buttons on the right) almost does this now.
The text pane is already editable, and supports paste. So all that is needed is to be able to open it with the text pane empty. Then the recipe can be pasted in there, edited, tagged and imported.
I tested this by "importing" a local text file, and then deleting the content of the text pane.

cydanil · 2021-10-28T13:23:26Z

I've been experimenting with drag and drop and pasting.
It resulted in the following (pasting not demonstrated):

What do you think?

eliotb · 2021-10-28T21:30:52Z

This looks great!
I'd be really happy with the second part where you drop selected text and get the import editor.

cydanil · 2021-11-10T15:41:14Z

This has been finalized in #67, where there's a couple of screencasts demonstrating drag and drop, similar to copy/paste.

I appreciate that supporting only a set of website looks like a regression in terms of functionality, even though it's compensated by plain text import.
I will see how easily can I make any url importable, as it used to be.

Thanks for your feedback and ideas!

cydanil mentioned this issue Sep 3, 2021

Make use of recipe-scrapers for web imports #27

Merged

6 tasks

newca12 mentioned this issue Oct 29, 2021

Gourmand crashes when trying to change fields while importing. #60

Closed

cydanil closed this as completed Nov 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make use of recipe-scrapers for more generic web imports #18

Make use of recipe-scrapers for more generic web imports #18

cydanil commented Aug 20, 2021

cydanil commented Sep 2, 2021 •

edited

Loading

cydanil commented Sep 4, 2021

eliotb commented Sep 11, 2021 •

edited

Loading

cydanil commented Oct 9, 2021

founderio commented Oct 9, 2021

cydanil commented Oct 9, 2021

eliotb commented Oct 9, 2021

cydanil commented Oct 28, 2021

eliotb commented Oct 28, 2021

cydanil commented Nov 10, 2021

Make use of recipe-scrapers for more generic web imports #18

Make use of recipe-scrapers for more generic web imports #18

Comments

cydanil commented Aug 20, 2021

cydanil commented Sep 2, 2021 • edited Loading

cydanil commented Sep 4, 2021

eliotb commented Sep 11, 2021 • edited Loading

cydanil commented Oct 9, 2021

founderio commented Oct 9, 2021

cydanil commented Oct 9, 2021

eliotb commented Oct 9, 2021

cydanil commented Oct 28, 2021

eliotb commented Oct 28, 2021

cydanil commented Nov 10, 2021

cydanil commented Sep 2, 2021 •

edited

Loading

eliotb commented Sep 11, 2021 •

edited

Loading