Skip to content

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
    • Help
    • Submit feedback
    • Contribute to GitLab
  • Sign in
S
searx
  • Project
    • Project
    • Details
    • Activity
    • Releases
    • Cycle Analytics
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Charts
  • Issues 1
    • Issues 1
    • List
    • Board
    • Labels
    • Milestones
  • Merge Requests 0
    • Merge Requests 0
  • CI / CD
    • CI / CD
    • Pipelines
    • Jobs
    • Schedules
    • Charts
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Members
    • Members
  • Collapse sidebar
  • Activity
  • Graph
  • Charts
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
  • gargantext
  • crawlers
  • searx
  • Issues
  • #2

Closed
Open
Opened Nov 05, 2019 by delanoe@anoe
  • Report abuse
  • New issue
Report abuse New issue

Searx -> Web Crawler

Parameters of the function:

  • Query: "bee* AND "collapse disorder""
  • Limit for searx results : eg 100
  • Depth of link of result: data Depth = Depth {in :: Int, out :: Int}
    • Depth 0 0 -> just get the text of the link (fetched from searx)
    • Depth 1 0 -> just get the text of the links in the targeted site with depth 1
    • Depth 0 1 -> just get the text of the neighbors links in the targed sites with depth of 1
  • Filter with Query : Bool (TODO later)
Assignee
Assign to
None
Milestone
None
Assign milestone
Time tracking
Nov 8, 2019
Due date
Nov 8, 2019
0
Labels
None
Assign labels
  • View project labels
Reference: gargantext/crawlers/searx#2