Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
H
haskell-gargantext
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Przemyslaw Kaminski
haskell-gargantext
Commits
01be6e4a
Commit
01be6e4a
authored
Mar 23, 2022
by
Alexandre Delanoë
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
[FEAT] Lang NLP
parent
5ed03e06
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
2 deletions
+3
-2
New.hs
src/Gargantext/API/Node/Corpus/New.hs
+2
-0
PosTagging.hs
src/Gargantext/Core/Text/Terms/Multi/PosTagging.hs
+1
-2
No files found.
src/Gargantext/API/Node/Corpus/New.hs
View file @
01be6e4a
...
...
@@ -310,6 +310,8 @@ addToCorpusWithForm user cid (NewWithForm ft ff d l _n) logStatus jobLog = do
--printDebug "Starting extraction : " cid
-- TODO granularity of the logStatus
printDebug
"flowCorpus with lang"
l
_cid'
<-
flowCorpus
user
(
Right
[
cid
])
(
Multi
$
fromMaybe
EN
l
)
...
...
src/Gargantext/Core/Text/Terms/Multi/PosTagging.hs
View file @
01be6e4a
...
...
@@ -41,7 +41,7 @@ tokens2tokensTags :: [Token] -> [TokenTag]
tokens2tokensTags
ts
=
filter'
$
map
tokenTag
ts
------------------------------------------------------------------------
tokenTag
::
Token
->
TokenTag
tokenTag
(
Token
_
_
w
l
_
_
p
n
_
_
)
=
TokenTag
w'
l'
p
n
tokenTag
(
Token
_
w
_
l
_
_
p
n
_
_
)
=
TokenTag
w'
l'
p
n
where
w'
=
split
w
l'
=
fromList
(
split
l
)
...
...
@@ -53,7 +53,6 @@ filter' xs = filter isNgrams xs
isNgrams
(
TokenTag
_
_
p
n
)
=
isJust
p
||
isJust
n
------------------------------------------------------------------------
-- request =
-- "fr" : {
-- "tokenize.language" : "fr",
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment