SCI2S Software

Description

WoS Query Partitioner is a tool that interactively splits a Web of Science query which returns more than 100,000 results into smaller queries to allow to easily obtain an exact result count.

It does work by splitting the query using the Source field (SO) in a kind of Divide and Conquer recursive strategy.

In addition the application offers two different kinds of graphs which depict the partitioning process as well as a LaTeX table output of the executed queries.

The details of its inner workings will be posted here once the theoretical work in which it is based is published.

Application

Please refer to the instructions & troubleshooting section if you have any problems or questions about the application use.

Graphs and Table

Once the application has finished partitioning a query two different graphs will be shown in this section. The first one is a tree graph structure that depicts how the original query has been splitted. The second one is a partition graph in which the number of results of each subquery is represented in a proportional way.

In addition, and to ease the incorporation of results to other works, the application also generates a LaTeX table code with all the subqueries that have been generated along with their number of results and total results count.

Queries Tree

Partition Graph

LaTeX Queries Table

Instructions & Troubleshooting

The use of the WoS Query Partitioner is quite simple:

  1. Insert the WoS Query to split in the Query field and press the Begin Query Partitioning button. For example, you can input the query PY=2007 and CU=USA.

    Application Snapshot

  2. A new window will open asking to execute a particular query to the WoS interface and requesting to input the number of results obtained in the blank field. To help the copy and paste mechanism into the Web of Science web page, the query to execute is automatically inserted into the system clipboard. If the results count of the executed query is greater than 100,000 (>100000), the field must be left blank. The Next Iteration button should then be pressed.

    Application Snapshot

  3. Step 2 will be repeated several times until a final results count is obtained. Once this count is shown, the Graphs and LaTeX Table will be generated.

    Application Snapshot

Troubleshooting

The WoS Query Partitioner has been deployed in form of a Java Applet that is run directly in a Web Browser. Thus, it is necessary to have the latest Java Plugin in order to run it. It can be freely downloaded for a great variety of platforms (including GNU/Linux, MacOS and even Windows) from the Java Webpage. The application should work in almost any platform, but it has only been thoroughly tested in a GNU/Linux environment.

The generated graphs use the SVG standard for vectorial graphics. Thus, a compatible browser will be needed in order to view those images. At the time of he writing, all the major browsers support SVG with the exception of Internet Explorer. In the deployment of the WoS Query Partitioner Firefox has been used and tested to correctly show the generated SVG images.

Finally, it is important to mention that as the application copies some information to the system clipboard (in order to ease the copy & paste to the Web of Science interface) a security warning will be shown (see below).

Security Warning

You should accept it in order to make the application work. Please note that the application has been deployed taking the highest care to avoid any security problems and that it is freely available in the hope that it will be useful, but WITHOUT ANY WARRANTY.

If you have any additional question about the WoS Query Partitioner, please do not hesitate to contact S. Alonso.

Acknoledgements and Citation

Sample Results

In the following we show the results of the application of the first Divide & Conquer approach to obtain the results count for the query TS=cancer. Note that this results where obtained in October 2009 and will have probably changed since then. Summing up, a total of 32 queries had to be executed to obtain the final count of 908155.

Sample Results
  Query Items Sum
#1 TS=cancer >100000  
#2 TS=cancer AND (SO=J* OR SO=C* OR SO=I* OR SO=S* OR SO=E* OR SO=N* OR SO=F* OR SO=H* OR SO=O* OR SO=Z* OR SO=W* OR SO=U* OR SO=Y* OR SO=X* OR SO=3* OR SO=5* OR SO=7* OR SO=9*) >100000  
#3 TS=cancer AND (SO=JOURNAL * OR SO=I* OR SO=E* OR SO=F* OR SO=O* OR SO=W* OR SO=JA* OR SO=JU* OR SO=JE* OR SO=JI* OR SO=JOURNALS* OR SO=JN* OR SO=JOI* OR SO=3* OR SO=7* OR SO=JM* OR SO=JS* OR SO=JOG* OR SO=JOK*) >100000  
#4 TS=cancer AND (SO=JOURNAL O* OR SO=E* OR SO=O* OR SO=JA* OR SO=JOURNAL D* OR SO=JE* OR SO=JOURNALS* OR SO=JOI* OR SO=3* OR SO=JM* OR SO=JOG* OR SO=JOURNAL I*) >100000  
#5 TS=cancer AND (SO=E* OR SO=O* OR SO=JOURNAL OF A* OR SO=JOURNAL OF M* OR SO=JOURNAL OF S* OR SO=JOURNAL OF B* OR SO=JOURNAL OF F* OR SO=JA* OR SO=JOURNAL OF R* OR SO=JOURNAL OF L* OR SO=JOURNAL OF D* OR SO=JOURNAL D* OR SO=JOURNAL OF K* OR SO=JOURNAL OF Z* OR SO=JOURNAL OF Q* OR SO=JOURNALS* OR SO=JOURNAL OF X* OR SO=JM* OR SO=JOURNAL I*) >100000  
#6 TS=cancer AND (SO=E* OR SO=JOURNAL OF A* OR SO=JOURNAL OF S* OR SO=JOURNAL OF F* OR SO=JOURNAL OF R* OR SO=JOURNAL OF D* OR SO=JOURNAL OF K* OR SO=JOURNAL OF Q* OR SO=JOURNAL OF X* OR SO=JOURNAL I*) 69181 69181
#7 TS=cancer AND (SO=O* OR SO=JOURNAL OF M* OR SO=JOURNAL OF B* OR SO=JA* OR SO=JOURNAL OF L* OR SO=JOURNAL D* OR SO=JOURNAL OF Z* OR SO=JOURNALS* OR SO=JM*) NOT #6 53647 122828
#8 TS=cancer AND (SO=JOURNAL OF T* OR SO=JOURNAL OF C* OR SO=JOURNAL OF P* OR SO=JOURNAL OF E* OR SO=JOURNAL OF N* OR SO=JOURNAL OF I* OR SO=JOURNAL OF H* OR SO=JOURNAL OF G* OR SO=JOURNAL OF O* OR SO=JOURNAL OF V* OR SO=JOURNAL OF W* OR SO=JOURNAL OF J* OR SO=JOURNAL OF U* OR SO=JE* OR SO=JOURNAL OF Y* OR SO=JOI* OR SO=3* OR SO=JOG*) NOT #6 NOT #7 >100000  
#9 TS=cancer AND (SO=JOURNAL OF THE * OR SO=JOURNAL OF P* OR SO=JOURNAL OF N* OR SO=JOURNAL OF H* OR SO=JOURNAL OF O* OR SO=JOURNAL OF W* OR SO=JOURNAL OF J* OR SO=JOURNAL OF THER* OR SO=JOURNAL OF TA* OR SO=JE* OR SO=JOURNAL OF TO* OR SO=JOI* OR SO=JOURNAL OF TU* OR SO=3*) NOT #6 NOT #7 41732 164560
#10 TS=cancer AND (SO=JOURNAL OF C* OR SO=JOURNAL OF E* OR SO=JOURNAL OF I* OR SO=JOURNAL OF G* OR SO=JOURNAL OF V* OR SO=JOURNAL OF TR* OR SO=JOURNAL OF U* OR SO=JOURNAL OF TE* OR SO=JOURNAL OF THEO* OR SO=JOURNAL OF Y* OR SO=JOURNAL OF THO* OR SO=JOURNAL OF TI* OR SO=JOURNAL OF THR* OR SO=JOG*) NOT #6 NOT #7 NOT #9 64133 228693
#11 TS=cancer AND (SO=I* OR SO=F* OR SO=W* OR SO=JOURNAL F* OR SO=JU* OR SO=JI* OR SO=JN* OR SO=JOURNAL A* OR SO=7* OR SO=JS* OR SO=JOK* OR SO=JOURNAL N*) NOT #6 NOT #7 NOT #9 NOT #10 73448 302141
#12 TS=cancer AND (SO=C* OR SO=S* OR SO=N* OR SO=H* OR SO=Z* OR SO=U* OR SO=Y* OR SO=X* OR SO=JC* OR SO=JOURNALI* OR SO=JB* OR SO=JOA* OR SO=JOR* OR SO=5* OR SO=9* OR SO=JP* OR SO=JOE* OR SO=JOH* OR SO=JOM*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 >100000  
#13 TS=cancer AND (SO=S* OR SO=H* OR SO=CA* OR SO=Z* OR SO=CL* OR SO=CE* OR SO=CI* OR SO=CY* OR SO=CZ* OR SO=CM* OR SO=JOURNALI* OR SO=JB* OR SO=JOR* OR SO=5* OR SO=JP* OR SO=JOH* OR SO=CB* OR SO=CT*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 >100000  
#14 TS=cancer AND (SO=S* OR SO=CA* OR SO=CL* OR SO=CI* OR SO=CZ* OR SO=JOURNALI* OR SO=JOR* OR SO=JP* OR SO=CB*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 >100000  
#15 TS=cancer AND (SO=CA* OR SO=SC* OR SO=CL* OR SO=SU* OR SO=CI* OR SO=SI* OR SO=SH* OR SO=SL* OR SO=SB* OR SO=SW* OR SO=SV* OR SO=SN* OR SO=CB* OR SO=SD* OR SO=SJ* OR SO=SZ*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 >100000  
#16 TS=cancer AND (SO=CL* OR SO=SCI* OR SO=CI* OR SO=CAR* OR SO=SH* OR SO=SCH* OR SO=SL* OR SO=CAH* OR SO=SCR* OR SO=SW* OR SO=SV* OR SO=SN* OR SO=SD* OR SO=SZ* OR SO=CAB* OR SO=CAO* OR SO=CAV* OR SO=SCU*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 34401 336542
#17 TS=cancer AND (SO=CAN* OR SO=SU* OR SO=SI* OR SO=SCA* OR SO=CAT* OR SO=CAL* OR SO=SCO* OR SO=CAM* OR SO=SB* OR SO=CAD* OR SO=CAS* OR SO=CB* OR SO=SJ* OR SO=CA-* OR SO=CAI* OR SO=CAP* OR SO=SCE*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 85724 422266
#18 TS=cancer AND (SO=SO* OR SO=ST* OR SO=SE* OR SO=SP* OR SO=SA* OR SO=SY* OR SO=SM* OR SO=SK* OR SO=CZ* OR SO=JOURNALI* OR SO=JOR* OR SO=JP* OR SO=S * OR SO=SG* OR SO=SR*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 16526 438792
#19 TS=cancer AND (SO=H* OR SO=Z* OR SO=CE* OR SO=CY* OR SO=CM* OR SO=JB* OR SO=5* OR SO=JOH* OR SO=CT*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 28768 467560
#20 TS=cancer AND (SO=N* OR SO=CO* OR SO=CH* OR SO=CU* OR SO=U* OR SO=CR* OR SO=Y* OR SO=X* OR SO=JC* OR SO=CN* OR SO=CC* OR SO=JOA* OR SO=CF* OR SO=9* OR SO=JOE* OR SO=JOM* OR SO=CS* OR SO=CW*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 57764 525324
#21 TS=cancer AND (SO=A* OR SO=P* OR SO=B* OR SO=M* OR SO=R* OR SO=T* OR SO=G* OR SO=D* OR SO=L* OR SO=V* OR SO=K* OR SO=Q* OR SO=2* OR SO=1* OR SO=4* OR SO=6* OR SO=8* OR SO=0*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 >100000  
#22 TS=cancer AND (SO=A* OR SO=B* OR SO=R* OR SO=G* OR SO=L* OR SO=K* OR SO=2* OR SO=4* OR SO=8*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 >100000  
#23 TS=cancer AND (SO=B* OR SO=G* OR SO=AC* OR SO=AR* OR SO=K* OR SO=AU* OR SO=AP* OR SO=AG* OR SO=AT* OR SO=AB* OR SO=AV* OR SO=AA* OR SO=AK* OR SO=A * OR SO=8*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 >100000  
#24 TS=cancer AND (SO=G* OR SO=BU* OR SO=AR* OR SO=BR* OR SO=AU* OR SO=AP* OR SO=AG* OR SO=BM* OR SO=BL* OR SO=AA* OR SO=AK* OR SO=BS* OR SO=BF* OR SO=BT* OR SO=8*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 98550 623874
#25 TS=cancer AND (SO=AC* OR SO=BI* OR SO=K* OR SO=BO* OR SO=BE* OR SO=BA* OR SO=AT* OR SO=AB* OR SO=AV* OR SO=BY* OR SO=B * OR SO=A * OR SO=BJ* OR SO=BW* OR SO=B-*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 39606 663480
#26 TS=cancer AND (SO=R* OR SO=AN* OR SO=L* OR SO=AM* OR SO=AD* OR SO=AS* OR SO=AL* OR SO=AF* OR SO=AQ* OR SO=AI* OR SO=2* OR SO=AE* OR SO=AJ* OR SO=4* OR SO=AX*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 >100000  
#27 TS=cancer AND (SO=RE* OR SO=L* OR SO=AD* OR SO=AL* OR SO=RU* OR SO=RO* OR SO=AQ* OR SO=RH* OR SO=AE* OR SO=RN* OR SO=RY* OR SO=AX* OR SO=R\&* OR SO=RS*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 35432 698912
#28 TS=cancer AND (SO=AN* OR SO=AM* OR SO=AS* OR SO=RA* OR SO=AF* OR SO=RI* OR SO=AI* OR SO=2* OR SO=AJ* OR SO=RL* OR SO=4* OR SO=R * OR SO=RB*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 87603 786515
#29 TS=cancer AND (SO=P* OR SO=M* OR SO=T* OR SO=D* OR SO=V* OR SO=Q* OR SO=1* OR SO=6* OR SO=0*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 NOT #28 >100000  
#30 TS=cancer AND (SO=M* OR SO=D* OR SO=PH* OR SO=PA* OR SO=PE* OR SO=PL* OR SO=PU* OR SO=PF* OR SO=PT* OR SO=1* OR SO=0* OR SO=PC* OR SO=PP*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 NOT #28 70364 856879
#31 TS=cancer AND (SO=T* OR SO=PR* OR SO=V* OR SO=PO* OR SO=PS* OR SO=Q* OR SO=PI* OR SO=PM* OR SO=PY* OR SO=6* OR SO=P * OR SO=PN*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 NOT #28 NOT #30 51130 908009
#32 TS=cancer NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 NOT #28 NOT #30 NOT #31 146 908155