Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
haskell-gargantext
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
153
Issues
153
List
Board
Labels
Milestones
Merge Requests
12
Merge Requests
12
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
gargantext
haskell-gargantext
Commits
75b2ee63
Commit
75b2ee63
authored
2 years ago
by
Przemyslaw Kaminski
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
[nodeStory] notes about implementation
Also, fix bug with reversing the history.
parent
6421aac1
Pipeline
#3043
failed with stage
in 58 minutes and 3 seconds
Changes
1
Pipelines
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
41 additions
and
20 deletions
+41
-20
NodeStory.hs
src/Gargantext/Core/NodeStory.hs
+41
-20
No files found.
src/Gargantext/Core/NodeStory.hs
View file @
75b2ee63
...
@@ -10,6 +10,30 @@ Portability : POSIX
...
@@ -10,6 +10,30 @@ Portability : POSIX
A Node Story is a Map between NodeId and an Archive (with state,
A Node Story is a Map between NodeId and an Archive (with state,
version and history) for that node.
version and history) for that node.
Couple of words on how this is implemented.
First version used files which stored Archive for each NodeId in a
separate .cbor file.
For performance reasons, it is rewritten to use the DB.
The table `node_stories` contains two columns: `node_id` and
`archive`.
Next, it was observed that `a_history` in `Archive` takes much
space. So a new table was created, `node_story_archive_history` with
columns: `node_id`, `ngrams_type_id`, `patch`. This is because each
history item is in fact a map from `NgramsType` to `NgramsTablePatch`
(see the `NgramsStatePatch'` type).
Moreover, since in ~G.A.Ngrams.commitStatePatch~ we use current state
only, with only recent history items, I concluded that it is not
necessary to load whole history into memory. Instead, it is kept in DB
(history is immutable) and only recent changes are added to
`a_history`. Then that record is cleared whenever `Archive` is saved.
Please note that
TODO:
TODO:
- remove
- remove
- filter
- filter
...
@@ -54,10 +78,10 @@ module Gargantext.Core.NodeStory
...
@@ -54,10 +78,10 @@ module Gargantext.Core.NodeStory
where
where
-- import Debug.Trace (traceShow)
-- import Debug.Trace (traceShow)
--
import Control.Debounce (mkDebounce, defaultDebounceSettings, debounceFreq, debounceAction)
import
Control.Debounce
(
mkDebounce
,
defaultDebounceSettings
,
debounceFreq
,
debounceAction
)
import
Codec.Serialise.Class
import
Codec.Serialise.Class
import
Control.Arrow
(
returnA
)
import
Control.Arrow
(
returnA
)
import
Control.Concurrent
(
MVar
(),
{-withMVar,-}
newMVar
,
modifyMVar_
)
import
Control.Concurrent
(
MVar
(),
withMVar
,
newMVar
,
modifyMVar_
)
import
Control.Exception
(
catch
,
throw
,
SomeException
(
..
))
import
Control.Exception
(
catch
,
throw
,
SomeException
(
..
))
import
Control.Lens
(
makeLenses
,
Getter
,
(
^.
),
(
.~
),
traverse
)
import
Control.Lens
(
makeLenses
,
Getter
,
(
^.
),
(
.~
),
traverse
)
import
Control.Monad.Except
import
Control.Monad.Except
...
@@ -171,7 +195,6 @@ instance (Semigroup s, Semigroup p) => Semigroup (Archive s p) where
...
@@ -171,7 +195,6 @@ instance (Semigroup s, Semigroup p) => Semigroup (Archive s p) where
,
_a_state
=
s'
,
_a_state
=
s'
,
_a_history
=
p'
<>
p
}
,
_a_history
=
p'
<>
p
}
-- instance Monoid (Archive NgramsState' NgramsStatePatch') where
instance
(
Monoid
s
,
Semigroup
p
)
=>
Monoid
(
Archive
s
p
)
where
instance
(
Monoid
s
,
Semigroup
p
)
=>
Monoid
(
Archive
s
p
)
where
mempty
=
Archive
{
_a_version
=
0
mempty
=
Archive
{
_a_version
=
0
,
_a_state
=
mempty
,
_a_state
=
mempty
...
@@ -307,7 +330,9 @@ getNodeStory pool (NodeId nodeId) = do
...
@@ -307,7 +330,9 @@ getNodeStory pool (NodeId nodeId) = do
insertNodeArchive
::
Pool
PGS
.
Connection
->
NodeId
->
ArchiveQ
->
IO
Int64
insertNodeArchive
::
Pool
PGS
.
Connection
->
NodeId
->
ArchiveQ
->
IO
Int64
insertNodeArchive
pool
nodeId
@
(
NodeId
nId
)
(
Archive
{
..
})
=
do
insertNodeArchive
pool
nodeId
@
(
NodeId
nId
)
(
Archive
{
..
})
=
do
ret
<-
withResource
pool
$
\
c
->
runInsert
c
insert
ret
<-
withResource
pool
$
\
c
->
runInsert
c
insert
insertNodeArchiveHistory
pool
nodeId
_a_history
-- NOTE: It is assumed that the most recent change is the first in the
-- list, so we save these in reverse order
insertNodeArchiveHistory
pool
nodeId
$
reverse
_a_history
pure
ret
pure
ret
where
where
emptyHistory
=
[]
::
[
NgramsStatePatch'
]
emptyHistory
=
[]
::
[
NgramsStatePatch'
]
...
@@ -321,7 +346,9 @@ insertNodeArchive pool nodeId@(NodeId nId) (Archive {..}) = do
...
@@ -321,7 +346,9 @@ insertNodeArchive pool nodeId@(NodeId nId) (Archive {..}) = do
updateNodeArchive
::
Pool
PGS
.
Connection
->
NodeId
->
ArchiveQ
->
IO
Int64
updateNodeArchive
::
Pool
PGS
.
Connection
->
NodeId
->
ArchiveQ
->
IO
Int64
updateNodeArchive
pool
nodeId
@
(
NodeId
nId
)
(
Archive
{
..
})
=
do
updateNodeArchive
pool
nodeId
@
(
NodeId
nId
)
(
Archive
{
..
})
=
do
ret
<-
withResource
pool
$
\
c
->
runUpdate
c
update
ret
<-
withResource
pool
$
\
c
->
runUpdate
c
update
insertNodeArchiveHistory
pool
nodeId
_a_history
-- NOTE: It is assumed that the most recent change is the first in the
-- list, so we save these in reverse order
insertNodeArchiveHistory
pool
nodeId
$
reverse
_a_history
pure
ret
pure
ret
where
where
emptyHistory
=
[]
::
[
NgramsStatePatch'
]
emptyHistory
=
[]
::
[
NgramsStatePatch'
]
...
@@ -383,13 +410,13 @@ nodeStoryIncs pool Nothing (ni:ns) = do
...
@@ -383,13 +410,13 @@ nodeStoryIncs pool Nothing (ni:ns) = do
readNodeStoryEnv
::
Pool
PGS
.
Connection
->
IO
NodeStoryEnv
readNodeStoryEnv
::
Pool
PGS
.
Connection
->
IO
NodeStoryEnv
readNodeStoryEnv
pool
=
do
readNodeStoryEnv
pool
=
do
mvar
<-
nodeStoryVar
pool
Nothing
[]
mvar
<-
nodeStoryVar
pool
Nothing
[]
--
saver <- mkNodeStorySaver pool mvar
saver
<-
mkNodeStorySaver
pool
mvar
let
saver
=
modifyMVar_
mvar
$
\
mv
->
do
--
let saver = modifyMVar_ mvar $ \mv -> do
writeNodeStories
pool
mv
--
writeNodeStories pool mv
printDebug
"[readNodeStoryEnv] saver"
mv
--
printDebug "[readNodeStoryEnv] saver" mv
let
mv'
=
clearHistory
mv
--
let mv' = clearHistory mv
printDebug
"[readNodeStoryEnv] saver, cleared"
mv'
--
printDebug "[readNodeStoryEnv] saver, cleared" mv'
return
mv'
--
return mv'
pure
$
NodeStoryEnv
{
_nse_var
=
mvar
pure
$
NodeStoryEnv
{
_nse_var
=
mvar
,
_nse_saver
=
saver
,
_nse_saver
=
saver
,
_nse_getter
=
nodeStoryVar
pool
(
Just
mvar
)
}
,
_nse_getter
=
nodeStoryVar
pool
(
Just
mvar
)
}
...
@@ -402,10 +429,8 @@ nodeStoryVar pool (Just mv) nIds = do
...
@@ -402,10 +429,8 @@ nodeStoryVar pool (Just mv) nIds = do
_
<-
modifyMVar_
mv
$
\
nsl
->
(
nodeStoryIncs
pool
(
Just
nsl
)
nIds
)
_
<-
modifyMVar_
mv
$
\
nsl
->
(
nodeStoryIncs
pool
(
Just
nsl
)
nIds
)
pure
mv
pure
mv
-- TODO No debounce since this is IO stuff.
-- Debounce is useful since it could delay the saving to some later
-- debounce is useful since it could delay the saving to some later
-- time, asynchronously and we keep operating on memory only.
-- time, asynchronously and we keep operating on memory only.
{-
mkNodeStorySaver
::
Pool
PGS
.
Connection
->
MVar
NodeListStory
->
IO
(
IO
()
)
mkNodeStorySaver
::
Pool
PGS
.
Connection
->
MVar
NodeListStory
->
IO
(
IO
()
)
mkNodeStorySaver
pool
mvns
=
mkDebounce
settings
mkNodeStorySaver
pool
mvns
=
mkDebounce
settings
where
where
...
@@ -413,17 +438,13 @@ mkNodeStorySaver pool mvns = mkDebounce settings
...
@@ -413,17 +438,13 @@ mkNodeStorySaver pool mvns = mkDebounce settings
{
debounceAction
=
do
{
debounceAction
=
do
withMVar
mvns
(
\
ns
->
writeNodeStories
pool
ns
)
withMVar
mvns
(
\
ns
->
writeNodeStories
pool
ns
)
withMVar
mvns
(
\
ns
->
printDebug
"[mkNodeStorySaver] debounce nodestory"
ns
)
withMVar
mvns
(
\
ns
->
printDebug
"[mkNodeStorySaver] debounce nodestory"
ns
)
modifyMVar_ mvns $ \ns -> pure $ clear
AHistoryToInsert
ns
modifyMVar_
mvns
$
\
ns
->
pure
$
clear
History
ns
,
debounceFreq
=
1
*
minute
,
debounceFreq
=
1
*
minute
}
}
minute
=
60
*
second
minute
=
60
*
second
second
=
10
^
(
6
::
Int
)
second
=
10
^
(
6
::
Int
)
-}
clearHistory
::
NodeListStory
->
NodeListStory
clearHistory
::
NodeListStory
->
NodeListStory
-- clearHistory (NodeStory ns) =
-- NodeStory $ Map.map (\(Archive { .. }) -> Archive { _a_history_to_insert = emptyHistory, .. }) ns
clearHistory
(
NodeStory
ns
)
=
NodeStory
$
ns
&
(
traverse
.
a_history
)
.~
emptyHistory
clearHistory
(
NodeStory
ns
)
=
NodeStory
$
ns
&
(
traverse
.
a_history
)
.~
emptyHistory
where
where
emptyHistory
=
[]
::
[
NgramsStatePatch'
]
emptyHistory
=
[]
::
[
NgramsStatePatch'
]
...
...
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment