Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
中
中电中采
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ZGC_INDEX
中电中采
Commits
d7763103
Commit
d7763103
authored
Dec 25, 2020
by
rico.liu
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
add deal crawl pic
parent
add0f18a
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
10 additions
and
2 deletions
+10
-2
main.py
模板建库/main.py
+10
-2
No files found.
模板建库/main.py
View file @
d7763103
...
@@ -392,6 +392,14 @@ def GetParamsinfoAndPic(df):
...
@@ -392,6 +392,14 @@ def GetParamsinfoAndPic(df):
response
=
requests
.
request
(
"POST"
,
request_url
,
data
=
payload
)
response
=
requests
.
request
(
"POST"
,
request_url
,
data
=
payload
)
res
=
eval
(
response
.
text
)
res
=
eval
(
response
.
text
)
#处理未爬取到的数据
for
element
in
res
:
if
element
:
pass
else
:
res
.
remove
(
element
)
res
.
append
({
'img_list'
:[],
'class_list'
:{},
'url'
:
''
})
df
[
'url_pic'
]
=
[
str
(
element
[
'img_list'
])
for
element
in
res
]
df
[
'url_pic'
]
=
[
str
(
element
[
'img_list'
])
for
element
in
res
]
crawl_params_list
=
[
str
(
element
[
'class_list'
])
.
replace
(
"'': ''"
,
""
)
.
replace
(
", ,"
,
","
)
.
replace
(
"{,"
,
"{"
)
.
replace
(
" "
,
""
)
for
element
in
res
]
crawl_params_list
=
[
str
(
element
[
'class_list'
])
.
replace
(
"'': ''"
,
""
)
.
replace
(
", ,"
,
","
)
.
replace
(
"{,"
,
"{"
)
.
replace
(
" "
,
""
)
for
element
in
res
]
url_params_list
=
[]
url_params_list
=
[]
...
@@ -1978,8 +1986,8 @@ path = '/Users/rico/project/模板建库v2/历史数据/20201202/路桥建库模
...
@@ -1978,8 +1986,8 @@ path = '/Users/rico/project/模板建库v2/历史数据/20201202/路桥建库模
#初始化数据
#初始化数据
InitializeData
(
path
)
InitializeData
(
path
)
#初始化参数
#初始化参数
channel_alias
=
'
CL
-MBJK'
channel_alias
=
'
TJX
-MBJK'
batch
=
'2020-12-
02
'
batch
=
'2020-12-
25
'
#加载数据
#加载数据
df
=
LoadData
(
batch
,
channel_alias
,
'deal'
)
df
=
LoadData
(
batch
,
channel_alias
,
'deal'
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment