provided by
provided by FIZ Karlsruhe

New on STN

March 2014

The latest release of the new STN platform, provides enhanced features and content that enable searchers to work more proficiently

February 2014

Access INFULL and DEFULL databases with STN Viewer

January 2014

DWPI: Latest Manual Code Revision goes live

Meet us at

PATINFO 2014- 36. Kolloquium der Technischen Universität Ilmenau über Patentinformation, Ilmenau, 04.-06. Jun

ICIC 2014, Heidelberg, 12.-14. Oct

STN User Meetings

Frankfurt, 06.05.2014
München, 13.05.2014
Essen, 22.05.2014


show / hide all

fold faq Can I use STN AnaVist, Version 1.0, 1.01, or 1.1?

No.  On May 31, 2008, STN AnaVist, Version 1, was discontinued.  To download the latest software, visit the STN Software License and Download web site.  Or if you prefer, contact CAS Customer Care for a free copy on CD-ROM.

fold faq Can I re-visualize Version 1 document sets in Version 2.0?

Yes.  Version 1 document set files (.rkx) created in STN Express, Version 8.0 or higher, can be re-visualized in STN AnaVist, Version 2.0.


Step 1: Locate saved .rkx files.

For STN Express, Versions 8.0 to 8.2, the default location for saved .rkx files is:

  • C:\stnexp\Trnscrpt

For STN Express, Version 8.3, the default locations are:

  • Windows XP: C:\Documents and Settings\[user]\My Documents\STN Express 8.3\Trnscrpt
  • Windows Vista: C:\users\[user]\Documents\STN Express 8.3\Trnscrpt

Step 2: Convert .rkx files to .xta file format.

  1. In STN Express, Versions 8.2 and higher, select File > Convert STN AnaVist .rkx file.
  2. Locate your .rkx file to convert (see Step 1), and click Open.
  3. When the Select STN AnaVist File window appears, click Save.  A message will indicate when the .xta file has been saved.

Step 3: Re-visualize .xta files in Version 2.0.

  1. In STN AnaVist, Version 2.0, click Import.
  2. Locate the .xta file from Step 2, and click Open.
  3. Click Visualize.
fold faq What software can I use to create answer sets for STN AnaVist?

All document sets must be created with the Save for STN AnaVist wizard available in STN Express, Version 8.2 or higher, or with the STN AnaVist Assistant in STN on the Web.

fold faq Which databases and clusters are available for document set creation in STN AnaVist?

The following databases/clusters are available for answer set creation:

CAplus family

fold faq Is highlighting retained when I import answers into STN AnaVist that have been saved in STN Express?

No.  Highlighting is not retained.  Hit term highlighting is not available in STN AnaVist.

fold faq What data fields can I use for custom visualization?

The following data fields may be used for clustering:  Titles/Abstracts, Claims (Exemplary/First Claim and All Claims), Technology Indicators, and International Patent Classification (IPC) Codes.  Visualizations may also be created using combinations of text fields.  Research Landscapes created with IPC codes include IPC codes only.

fold faq Will I be charged another visualization fee if I visualize again based on a different set of data fields?


fold faq Will I be charged more for visualization if my document set includes records from multiple databases?

No.  The price to create a visualization project is determined by the total number of records included in the project, regardless of which database or databases are included.

fold faq How long does it take to visualize a results set?

The time it takes for a visualization to complete depends on several factors, including how many documents there are to be visualized and what else the system is doing when the visualization initiates.  The larger the document set to be visualized, the longer the visualization will take.

fold faq When I convert an .rkx file, is the resulting .xta file updated with any new documents or indexing?

No.  The .rkx file is static and includes only the Accession Numbers (AN) of the answers saved on the day the file was created.

fold faq When working with the visualization results, are there any activities that may take longer than expected to complete?

In general, visualization activities happen fairly quickly.  Some requests, however, may take longer to complete:


  • Displaying an answer with a very large number of CAplus index terms
  • Displaying a very long patent document
  • Opening the list of cluster concepts for term grouping if the result set contains a large number of documents, i.e., several thousand answers
  • Displaying all entries for a chart or matrix when the total number of entries is in the thousands
fold faq Can I create a document set for Version 2.0 in CASREACT, MARPAT, the CA family of databases, and/or the CASLINK cluster?

Yes.  Conduct a search in one or more of these databases or the CASLINK cluster and then search the L-number in CAplus.  Use the Save for STN AnaVist Wizard with the resulting L-number created in CAplus.  For example:

=> file casreact

=> s vinyl chloride
         22325 VINYL
       115824 CHLORIDE
L1         482 VINYL CHLORIDE

=> file caplus

=> s L1
L2         482 L1

fold faq Why do I get an error message in STN Express, Version 8.2, when I use the Save for STN AnaVist Wizard with more than 20,000 records?

Previous versions of STN AnaVist included a filtering feature capable of reducing the number of imported documents to 20,000 or less.  Version 2.0 does not include this feature.  Document sets must be reduced to fewer than 20,000 documents in STN Express prior to using the Save for STN AnaVist Wizard.

fold faq What information is included in documents displayed in STN AnaVist?

Documents are displayed in a condensed format that provides bibliographic data and the abstract, if available.  The format also includes the data available to create bar and matrix charts.  A minimum of three additional display options per database are available in STN Express via the Display from STN AnaVist wizard.

fold faq What file formats are available when I save documents?

Documents can be saved in .rtf or .pdf formats.

fold faq Why is USPAT2 indicated as a source database for U.S. patent documents in STN AnaVist, Version 2.0, when in previous versions, only USPATFULL was indicated as the source?

Previous versions of STN AnaVist do not indicate USPAT2 as a source database because USPATFULL and USPAT2 were combined into a single database.

Version 1

Version 2.0

Combined as a single database



Source identification




First, intermediate, and latest publication stages

First and latest stages only

When using the Display from STN AnaVist Wizard in STN Express, select USPATFULL and USPAT2 for more comprehensive results.

fold faq Can I save full-text patent documents from STN AnaVist to evaluate in STN Viewer?

Yes.  When saving a document set, select the .xta file format. A fully-functional L-number can be created with the create L-number from STN AnaVist wizard in STN Express.  The L-number can be used to:


  • Display answers from STN AnaVist in any format
  • Move full-text patent documents from STN AnaVist to STN Viewer via the Evaluate with STN Viewer wizard
fold faq Did CAS develop the clustering algorithm that creates the Research Landscape in STN AnaVist?

CAS licensed visualization software developed by Sandia National Laboratories as the base software for generating the Research Landscape.  The software was significantly enhanced to improve visualization results, utilizing the expertise of our database building staff and scientists:

  • CAS vocabulary to standardize the clustering concepts
  • A stopword list to improve cluster results for sci-tech searches

These enhancements allow for the software to produce more scientifically relevant clusters that are focused on scientific and intellectual property information.

STN AnaVist uses force-directed placement for the text clustering.  Force-directed placement is a fast clustering method, most notably with larger answer sets.  To learn more about the clustering method used in STN AnaVist, refer to:

fold faq Can I adjust the Concept Frequency when creating Research Landscape based on IPC codes?

No.  Concept Frequency is always set to 100% and cannot be altered.

fold faq When selecting data fields for clustering, what is a Backup field?

If a document does not include data in the selected clustering field, the Backup field is used for clustering instead.  For example, including a backup field is important when clustering on Claims alone in a document set that includes patent and nonpatent documents.

fold faq Why do clustering options sometimes change?

Available clustering options are based on the source databases of the imported documents.  For example, the option to cluster using Technology Indicators is available only when documents from CAplus, USPATFULL, and USPAT2 are source databases.  Technology Indicators are CAS indexing terms that are only available in documents from CAplus, and in chemistry-relevant USPATFULL and USPAT2 documents that have been enhanced with this indexing.

fold faq Does a custom visualization with Technology Indicators include indexing terms from WPINDEX?

No.  Technology Indicators are CAS indexing terms that are only available in documents from CAplus, and in chemistry-relevant USPATFULL and USPAT2 documents that have been enhanced with indexing.

fold faq Why do certain clustering concepts in the Research Landscape seem to be mistakes or misnomers?

Use the Term Editor to change the name of any clustering concept. You can also report suspected anomalies by using the Contact Us button within STN AnaVist.  The list of clustering concept terms is continually updated.

fold faq Why don't researcher and company names in the bibliographic data always correspond to the charts?

Researcher names are grouped together under a standard representation whenever possible prior to the creation of charts with Key Researcher data.

Prior to creating charts with Key Organization/Assignee data, company names are standardized and grouped using an improved version of the CAS Company Name Thesaurus. 

The names appearing in the chart are the standardized names to which a number of variations may be mapped.

fold faq Can I use my own customized dictionary to standardize concepts?

Not at this time.

fold faq Why are documents displayed from STN AnaVist in STN Express not reflected in the STN AnaVist session summary?

Documents displayed using the Display from STN AnaVist Wizard are reflected in the STN Express expenditures and do not appear in the STN AnaVist session summary.

fold faq If I display a document in STN AnaVist, will I be charged to display it again if I use the Display from STN AnaVist Wizard?

Yes.  STN Express does not account for previous displays in STN AnaVist.

fold faq As a subscriber to WPIDS/WPIX, can I display WPIDS/WPIX documents in Version 2.0?

No.  The source database for displayed DWPISM documents is WPINDEX.  To display documents in WPIDS or WPIX formats, use the Display from STN AnaVist Wizard in STN Express.

fold faq Can I display both invention- and member-level documents from WPINDEX in Version 2.0?

No.  To display member-level documents, use the Display from STN AnaVist Wizard in STN Express.

fold faq What are the Technology Indicator terms in CAplus documents?

The terms used to create the Technology Indicators chart are from controlled indexing in CAplus documents.  The terms are harmonized to the current index terms where possible, to avoid data scattering.

fold faq Why is there a distant peak in the corner of my Research Landscape?

STN AnaVist creates Research Landscapes based on data available in the user-defined or default clustering fields.  Documents that do not have data in any of the clustering fields (including the Backup field) are identified as different from the other documents and are placed in a peak away from the others.  This peak may also include documents with clustering field data that is truly unique compared to the other documents.

fold faq Can I identify cells within a matrix chart that are applicable to more than one highlighting group?

Yes.  With the multiple-color highlighting feature in Version 2.0, certain cells within a matrix chart may be associated with more than one highlighting group.  Such cells appear with black triangles in each corner of the cell.  Hovering over the cell provides additional information.

fold faq Why do I get a message that there was not enough system memory to create a matrix chart?

Certain matrix charts may require a significant amount of system memory.  Try one or more of these solutions to avoid the system memory error:

  • Reduce the number of highlighting colors used at one time.
  • Reduce the number of rows and columns in the matrix by modifying Chart Properties.  To do this, right-click the chart and select Properties.  Under the View tab, reduce the number of rows and columns to include in the matrix chart.  Default matrix chart settings may be modified in the Charts menu available in Tools > Preferences.  (A default setting of 50 cells on each axis is optimal for performance.)
  • Create two bar charts for a similar result.  A matrix chart displays the number of documents where terms co-occur.  Co-occurring terms can also be displayed using bar charts with the highlighting feature.  When a term is highlighted in one bar chart, co-occurring terms are highlighted in the other bar chart.  By selecting Sort Descending by Highlighted Count, the co-occurring terms will automatically sort to the top each time you highlight another term.  You may set a higher number of bars on a bar chart than rows or columns on a matrix chart without system memory issues.
fold faq Can I identify records within a bar that are applicable to more than one highlighting group?

Yes.  With the multiple-color highlighting feature in Version 2.0, any documents belonging to more than one highlighting group are grouped together in a single gray segment of the bar.  The documents represented by this segment are members of multiple highlighting groups, but not necessarily the same highlighting groups.

fold faq How can I print all documents in a visualization project?

To print all documents in a visualization project:

  1. Click and drag on the Research Landscape to highlight all documents.
  2. Click the Print icon and select the desired print options.

Only 1000 documents at a time can be printed.

fold faq Are there any preference settings that should not be customized?

Yes.  To ensure good performance, keep the setting for new Matrix Charts at its default setting of 50 rows and columns.  To see more data a matrix chart, right-click the chart and select Properties.  Under the View tab, modify the number of rows and columns to include in the matrix chart.

fold faq Can I print the entire set of documents after importing but before visualization?

No.  An entire results set can only be printed after it has been visualized. Prior to visualization, individual documents can be printed from the detailed view of that result.

fold faq Can I create multiple copies of a visualization project?

Select File > Save Copy of <Project>.  The project will be saved as an .shx file.  When the saved project is opened, highlighting, additional charts, and edited terms, are available just as in the original project.

fold faq Can I share projects across service centers with STN AnaVist?

It is not possible to share .shx files across service centers at this time.  However, .xta files may be shared between users with full-access STN login IDs.

fold faq Are visualization projects saved to large files?

No.  The .shx file created when a copy is saved is only a link to the visualization project maintained on the CAS server.

fold faq Can I edit the Detailed Report and the Summary Report?

Both the Detailed Report and the Summary Report are available as either .rtf or .pdf documents.  Both can be modified with an appropriate editor.

fold faq Can I create my own charts with the visualization data?

Yes.  Chart data can be saved in comma delimited (.csv) format for use in Microsoft Excel.

fold faq What is the difference between the PNG and JPEG image formats available for saving an image of the landscape?

The PNG format is much more compact and provides a better image than the JPEG format.

fold faq Why does my list of Technology Indicators in STN AnaVist appear to be longer than the list saved in comma delimited format (.csv) for Microsoft Excel?

There is a limit on the number of columns Microsoft Excel can handle. Imported data is truncated when that limit is reached.