Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
S
searx-engine
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
1
Merge Requests
1
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
gargantext
searx-engine
Commits
3b672039
Commit
3b672039
authored
Jan 06, 2015
by
Adam Tauber
Browse files
Options
Browse Files
Download
Plain Diff
Merge pull request #167 from Cqoicebordel/minor-fixes
Minor fixes
parents
7b531c6f
3aa3a463
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
15 deletions
+10
-15
digg.py
searx/engines/digg.py
+3
-0
vimeo.py
searx/engines/vimeo.py
+7
-15
No files found.
searx/engines/digg.py
View file @
3b672039
...
...
@@ -44,6 +44,9 @@ def response(resp):
search_result
=
loads
(
resp
.
text
)
if
search_result
[
'html'
]
==
''
:
return
results
dom
=
html
.
fromstring
(
search_result
[
'html'
])
# parse results
...
...
searx/engines/vimeo.py
View file @
3b672039
...
...
@@ -13,24 +13,23 @@
# @todo set content-parameter with correct data
from
urllib
import
urlencode
from
HTMLParser
import
HTMLParser
from
lxml
import
html
from
searx.engines.xpath
import
extract_text
from
dateutil
import
parser
from
cgi
import
escape
# engine dependent config
categories
=
[
'videos'
]
paging
=
True
# search-url
base_url
=
'http
s
://vimeo.com'
base_url
=
'http://vimeo.com'
search_url
=
base_url
+
'/search/page:{pageno}?{query}'
# specific xpath variables
results_xpath
=
'//div[@id="browse_content"]/ol/li'
url_xpath
=
'./a/@href'
title_xpath
=
'./a/div[@class="data"]/p[@class="title"]'
content_xpath
=
'./a/img/@src'
title_xpath
=
'./a/div[@class="data"]/p[@class="title"]/text()'
results_xpath
=
'//div[@id="browse_content"]/ol/li'
publishedDate_xpath
=
'.//p[@class="meta"]//attribute::datetime'
...
...
@@ -39,10 +38,6 @@ def request(query, params):
params
[
'url'
]
=
search_url
.
format
(
pageno
=
params
[
'pageno'
],
query
=
urlencode
({
'q'
:
query
}))
# TODO required?
params
[
'cookies'
][
'__utma'
]
=
\
'00000000.000#0000000.0000000000.0000000000.0000000000.0'
return
params
...
...
@@ -52,15 +47,12 @@ def response(resp):
dom
=
html
.
fromstring
(
resp
.
text
)
p
=
HTMLParser
()
# parse results
for
result
in
dom
.
xpath
(
results_xpath
):
url
=
base_url
+
result
.
xpath
(
url_xpath
)[
0
]
title
=
p
.
unescape
(
extract_text
(
result
.
xpath
(
title_xpath
)))
thumbnail
=
extract_text
(
result
.
xpath
(
content_xpath
)[
0
])
publishedDate
=
parser
.
parse
(
extract_text
(
result
.
xpath
(
publishedDate_xpath
)[
0
]))
title
=
escape
(
html
.
tostring
(
result
.
xpath
(
title_xpath
)[
0
],
method
=
'text'
,
encoding
=
'UTF-8'
)
.
decode
(
"utf-8"
))
thumbnail
=
result
.
xpath
(
content_xpath
)[
0
]
publishedDate
=
parser
.
parse
(
result
.
xpath
(
publishedDate_xpath
)[
0
])
# append result
results
.
append
({
'url'
:
url
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment