Situation 1:
I have a list of company names and last names. I need to search through a list of company names RELATED to the last name.
For example, I need to search for the combination of "Company A" with "Last Name A" at the same time. Then the process needs to loop and search the combination of "Company B" with "Last Name B," then "Company C" with "Last Name C," etc., etc., etc.
This loop needs to continue through an entire list of COMPANY NAMES that are directly related to LAST NAMES. (In other words, I'm not looping through Last Names for one company and then looping through the Last Names again for the next company and so on--instead--each combination of COMPANY NAME and LAST NAME make ONE RECORD that I need to search for each time).
Situation 2:
I have to conduct a search by entering TWO separate terms in TWO separate text boxes on the search form.
The looping lists I have are each related. That means for instance, when I enter "A" in box one, I need to enter "1" in box two to complete the search. When I enter "B" in box one, I need to enter "2" in box two to complete the search. When I enter "C" in box one, I need to enter "3" in box two to complete the search. (Etc., etc., etc.,)
e.g.
A --> 1
B --> 2
C --> 3
D --> 4
E --> 5
Etc., etc., etc.
So I have two lists that need to "loop" in tandem with each other (List 1 > List 2).
A:
Currently it's hard for Octoparse to do this type of "in tandem" loop.
That is, Octoparse cannot deal with the “text list loop” when completing the search on a one-to-one basis.
Let’s take the “Situation 2” for example.
We have two lists for two separate text boxes on the search form.
The first list for text box one is A, B, C, D...
The second list for text box two is 1, 2, 3, 4...
We need to get the search results by entering A in text box one and 1 in text box two separately, or entering B and 2 separately, and etc. If you create two “Text list” loop to enter these two lists,
and Octoparse will execute an one-to-many match like this:
In this case, Octoparse will extract some extra data you don't want to get.