Skip to content

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
    • Help
    • Submit feedback
    • Contribute to GitLab
  • Sign in
haskell-gargantext
haskell-gargantext
  • Project
    • Project
    • Details
    • Activity
    • Releases
    • Cycle Analytics
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Charts
  • Issues 175
    • Issues 175
    • List
    • Board
    • Labels
    • Milestones
  • Merge Requests 9
    • Merge Requests 9
  • CI / CD
    • CI / CD
    • Pipelines
    • Jobs
    • Schedules
    • Charts
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Members
    • Members
  • Collapse sidebar
  • Activity
  • Graph
  • Charts
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
  • gargantext
  • haskell-gargantexthaskell-gargantext
  • Issues
  • #165

Closed
Open
Opened Jan 10, 2023 by david Chavalarias@davidchavalarias
  • Report abuse
  • New issue
Report abuse New issue

Remove Duplicated in Doc Table

In 0.0.6.8.6, document differing by formating (cap, extra spaces, comma, etc.) stay duplicated. The function that compare documents at import should make a tolower on all strings and remove space and punctuation before comparison. A python script from Quentin already exists for the clean-up : https://gitlab.iscpif.fr/davidchavalarias/gargantext_ressources/blob/master/script/removeDuplicates.py

Edited Jan 10, 2023 by david Chavalarias
Assignee
Assign to
None
Milestone
None
Assign milestone
Time tracking
None
Due date
None
1
Labels
docs table
Assign labels
  • View project labels
Reference: gargantext/haskell-gargantext#165