Beruflich Dokumente
Kultur Dokumente
Applies to:
SAP BusinessObjects Global Trade Services 8.0 For more information, visit the Governance, Risk, and Compliance homepage
Summary
Search and Classification TREX provides SAP applications with numerous services for searching, classification, and text-mining in large collections of documents (unstructured data) as well as for searching in and aggregating across business objects (structured data). This document shows how to administer TREX. Author: Company: Prarit Sehgal Regional Implementation group Governance, Risk, and Compliance SAP BusinessObjects Division 16 September 2009
Created on:
Version 1
Typographic Conventions
Type Style Example Text Description Words or characters quoted from the screen. These include field names, screen titles, pushbuttons labels, menu names, menu paths, and menu options. Cross-references to other documentation Example text Emphasized words or phrases in body text, graphic titles, and table titles File and directory names and their paths, messages, names of variables and parameters, source text, and names of installation, upgrade and database tools. User entry texts. These are words or characters that you enter in the system exactly as they appear in the documentation. Variable user entry. Angle brackets indicate that you replace these words and characters with appropriate entries to make entries in the system. Keys on the keyboard, for example, F2 or ENTER.
Icons
Icon Description Caution Note or Important Example Recommendation or Tip
Example text
Example text
<Example text>
EXAMPLE TEXT
Table of Contents
1. 2. 3. 4. Business Scenario............................................................................................................... 1 Background Information ..................................................................................................... 1 Prerequisites ........................................................................................................................ 2 Step-by-Step Procedure ...................................................................................................... 3 4.1 4.2 4.3 Starting the TREX admin tool ....................................................................................... 3 Connecting TREX with an Application .......................................................................... 3 4.2.1 4.3.1 4.3.2 4.3.3 4.4 5. Connecting TREX with an ABAP Application (RFC Connection) .................... 4 Starting and Stopping TREX .......................................................................... 11 TREX Admin Tool .......................................................................................... 11 Data backup and Restore for TREX .............................................................. 16 TREX Administration .................................................................................................. 11
FAQ ..................................................................................................................................... 18 5.1 5.2 Stopping TREX with the TREX Admin Tool ............................................................... 18 Selecting Languages for TREX Language Recognition ............................................. 19
6. 7.
1.
Business Scenario
Search and Classification TREX provides SAP applications with numerous services for searching, classification, and text-mining in large collections of documents (unstructured data) as well as for searching in and aggregating across business objects (structured data). TREX processes documents in a wide variety of formats and languages. TREX administration is organized into the following areas: Starting and Stopping TREX To start and stop TREX, you use the SAP Management Console and the executable files startsap.exe and stopsap.exe (Windows) or the Shell scripts startsap and stopsap (UNIX). TREX Admin Tools TREX provides the following administration tools: TREX Monitor in the portal You use this tool if you are implementing the Knowledge Management functions in the portal. TREX Admin Tool in the SAP System You use this tool if TREX is connected to an SAP system. TREX Admin Tool (Stand-Alone) You can use this tool independent of a portal or an SAP system. Data Backup and Restore for TREX You can make a backup of a TREX installation, and restore the TREX installation using the backup version if problems occur. Administration of the Search Engine Service The Search Engine Service (SES) enables users to search for business objects using TREX. As an administrator, you complete the initial configuration of the search engine service and monitor it during routine operation.
2. Background Information
TREX is based on client/server architecture. The client component is integrated into the application that uses the TREX functions, and allows communication with the TREX servers. The server component processes the requests; it indexes and classifies documents and answers search queries. The client component is subdivided into the Java client and ABAP client. The server component is subdivided into the following servers: Web server with TREX extension RFC server Queue server Preprocessor Index server Name server
3. Prerequisites
The TREX in installed, the user has access to TREX admin tool. You must create an SAP user that the TREX admin tool (stand-alone) can use to log on to the SAP system. In addition, the SAP user is required so that the TREX alert server has permission to regularly test and check the RFC configuration. The TREX admin tool (stand-alone) is used to configure and monitor TREX. You also use this admin tool to configure the RFC connection between TREX and the ABAP application that is using TREX. To use the TREX admin tool (stand-alone) to create the RFC destination, the admin tool requires an SAP system user that you create based on the predefined role SAP_BC_TREX_ADMIN. This user then has the authorization required to configure the RFC connection.
4. Step-by-Step Procedure
4.1 Starting the TREX admin tool
Log on with the user <sapsid>adm, and carry out one of the following steps Operating system: UNIX Procedure: Enter the following: cd <TREX_DIR> ./TREXAdmin.sh Operating system: Windows Procedure: Start the TREX admin tool by double-clicking <TREX_DIR>\TREXAdmin.bat in Windows Explorer
Determining the SAP System Connection Information The TREX admin tool (stand-alone) can connect to an SAP system in two ways. Through a specific application server of the SAP system (variant A) Through the message server of the SAP system (variant B) This variant uses the load-balancing function for the SAP system. The message server assigns the request from the TREX admin tool to any application server. Depending on the variant used, the TREX admin tool requires different connection information for the SAP system. You must determine the connection information and specify it later in the TREX admin tool. NOTE: SAP recommends using variant B. Variant A has the disadvantage that the connection does not work if the application server is not available. Procedure 1. Open the SAP Logon. SAP Logon is the program that you use to log on to an SAP system.
SAP system ID (SID) System number Application server host name Through the message server (variant B) SAP system ID (SID) Logon group, such as PUBLIC Message server host name
Configuring the RFC Connection in the TREX Admin Tool You work through the steps below using the TREX admin tool Creating a Connection 1. On the RFC: Current tab page in the Landscape: Connectivity window, choose the Create Connection function.
2. Choose connection type A or B. Specify the connection data for the SAP system.
3. Specify the SAP system user, the associated password, and the client that the TREX admin tool is to use to log on
If the SAP system user in question exists in the default client, you do not need to specify the client. Now click on RFC Destination Add details, SAP System, RFC Destination, and click save
After performing the above steps RFC connection is created in SM59. Go to transaction SM59
Click drop down TCP/IP connection Double click the RFC connection created TREX_11
You use the following executable files and scripts to start and stop TREX on windows and UNIX: Windows TREX admin tool (standalone) The startsap.exe and stopsap.exe executable files SAP Management Console As of TREX 7.10, the SAP Management Console is a snap-in in the Microsoft Management Console (MMC) and is no longer part of the TREX installation. To start TREX with the SAP Management Console, you must install the console in your TREX system first. UNIX TREX admin tool (standalone) The startsap and stopsap shell scripts
4.3.2
You can use this tool independently of a portal or an SAP system. The TREX admin tool provides all functions for configuring and monitoring TREX. You can use this tool to configure a distributed TREX system, to configure the RFC connections, to administrate delta indexes, and to trigger performance traces. In addition, this tool provides detailed technical information about the components of a TREX system. Using periodically executed checks, it can monitor the system status automatically. In the event of errors, you are informed by e-mail so that you can take appropriate action
Displays information on the processing of documents in a queue. See: Information on Processing Documents For displaying and Changing Queue Parameters See: Queue Parameters For deleting a document from the queue if errors occur. See: Deleting Documents from a Queue For stopping queues and reactivating them, and triggering the complete processing of a queue. See: Suspending a Queue Activating a Queue Triggering the Processing of a Queue For resetting the error status of documents. This affects documents that have not been preprocessed, indexed, or optimized. See: Resetting the Error Status For displaying Index Parameters See: Displaying Index Parameters For emptying the search and classification cache. See: Emptying the Search and Classification Cache
In the Connection to RFC Server field, use the RFC destination to choose which TREX system you want to administrate. Once you have selected the RFC destination, the administration areas become active.
The following administration areas are available in the TREX admin tool in the SAP system (transaction TREXADMIN): Tab Queue Administration This tab page is only relevant if indexing takes place using the queue server. Queue Administration You can monitor queues, configure queue parameters, and troubleshoot here. More information: Queue Administration Tab Page. Queue Landscape This tab page is relevant only for distributed TREX systems and only if indexing is done using a queue server. Description
Here you can see the distribution of the queues among the different queue servers. More information: Queue Landscape Tab Page. Index Administration This tab page displays technical information on indexes. Index Admin For example, here you can see the time of the last indexing, the number of searchable documents, or the disk space taken up by the index. This tab page displays the hierarchical structure of an index. Index Hierarchy It shoes the index type of an index. In the case of a split index, you can see which index parts it consists of. More information: Index Types. Join Index Info This tab page displays technical information about compound indexes and BIA indexes. This tab page is only relevant for distributed TREX systems. Index Landscape Here you can see the distribution of the indexes among the different index servers. In addition, you can check the index replication status. For more information, see Index Distribution and Index Replication Status. Usage Search This area displays information about index use and statistics. You can trigger searches from the Search tab page. You use this function for test purposes only. You can use this TREX function to display the statistical distribution of attributes in an index.
You use this tab page to administrate and configure the TREX cruiser components. The TREX cruiser enables a file search on file servers and Web servers. TREX then indexes the files that are extracted from the searched file and Web servers. Cruiser More information: Cruiser Administration. Note For more information on using and optimizing the TREX cruiser, see
SAP Note 1110112 TREX 7.0/7.1: How to use the TREX Cruiser.
Landscape Administration This tab page provides an overview of the TREX status with information on the TREX version, operating system, and current error messages. This tab page is used to monitor TREX. Checks are performed periodically and in the event of errors, you can be notified by e-mail. For more information, see TREX Alert Server. This tab page enables the reorganization of the indexes in a TREX landscape. More information: Reorganization of the TREX Landscape. This tab page displays technical information on a TREX system. Topology For more information, see Topology Information. Displays the TREX configuration files. You can display the content of the configuration files and change it if necessary. Caution Changing configuration files directly requires expert knowledge of the TREX architecture and TREX-internal processes. Only make direct changes to the configuration files if those changes are described in the TREX documentation.
Summary
Alert
Reorg
Ini Files
Hosts
This tab page displays information on the installation directory, operating system platform, and installed TREX release. More information: Information on TREX Servers (TREXADMIN) Displays the load on the TREX servers in a table. There is a separate row for each TREX server. More information: Information on TREX Servers (TREXADMIN) On the Threads tab page, you can see which threads are currently executing which methods. You can also see how long processing of a method has taken so far. More information: Information on TREX Servers (TREXADMIN)
Services
Threads
Handles
On the Handles tab page, you can see the number of handles that a process has generated and is using. More information: Information on TREX Servers (TREXADMIN) Displays the trace files of the TREX servers.
Trace
By default, the trace files contain all error messages that the TREX servers output. You can display the contents of a trace file by double-clicking it. You can download the trace files. This tab page displays the configuration of a distributed TREX system. You can only display data here. You configure a distributed TREX system using the TREX admin tool (standalone).
Landscape
For more information about setting up and configuring a distributed TREX system, see Distributed TREX Systems (Multiple Host Installation) and Setting Up a Distributed System. This tab page shows whether or not all of the SAP system application servers have a connection to the TREX system. If there are errors here, they can have the following causes: The application server is communicating through an SAP gateway that has not been started. The parameters of the RFC destination do not match the TREX configuration. No TREX RFC server has registered with the SAP gateway. Note The window shows only whether the connection from within the SAP system works. It does not display whether the connection works from within the TREX system. The TREX admin tool (standalone) displays the complete status of the RFC connection and allows you to configure the RFC connection and correct the error. For more information on the RFC connection between TREX and the application using TREX, see Configuring and Administrating the RFC Connection.
RFC Monitoring
4.3.3
You can back up Search and Classification (TREX), and restore the TREX installation using the backup version if problems occur (backup and restore). You can back up and restore TREX in the following ways:
Data Backup (Online) and Restore You back up TREX indexes and queues online using Python scripts without stopping TREX and while the TREX search continues to be available. You then restore the saved data offline. This is the usual way to back up TREX data. It is a good idea to back up the TREX indexes if the original index creation process took a long time and you want to avoid having to reindex if the full-text information is lost. SAP also recommends that you back up your data if a large number of documents have been added to an index since the original indexing process.
Complete Data Backup and Restore (Offline) You back up a complete TREX installation (TREX indexes and configuration) offline and restore it again. You do this only in special circumstances, particularly to back up the settings in a TREX configuration. In this case, the complete data backup and restore of the TREX installation can take the place of a new installation.
b. Determine the port of the TREX name server by means of the following rule: <name_server_port>: 3<instance_number>01 On UNIX, navigate to following location or path shown below.
2. Use the user <j2eeadm> to log onto the host on which the Application Server Java is running.
3. Start the SAP NetWeaver Administrator and log on to the AS Java . 4. Navigate to Configuration Management Infrastructure Management Java System Properties. 5. In the new screen go to the tabulator Services in the screen area Details and type in trex.service to filter the available services. 6. In the Extended Details area under the tabulator Properties enter the address of the TREX name server into the parameter nameserver.address: tcpip://<host_name_of_trex_host>:<name_server_port> You enter only the host name or the host name and the domain depending on your network environment. tcpip://mytrexhost:34801 or tcpip://mytrexhost.mydomain:34801
The address of the TREX name server must be configured for all server processes of the cluster. Otherwise the connection between the AS Java and TREX cannot be established. 7. In case of a multiple host scenario for a distributed TREX landscape you enter the addresses of the TREX backup name servers in the parameter nameserver.backuplist. Specify the address of the TREX backup name servers, separated by comma in the format: tcpip://<host1>:<port1>,tcpip://<host1>:<port1>, For a TREX single-host installation you do not need to specify backup name servers. 8. Save your changes and confirm the restart of the service.
5. FAQ
5.1 Stopping TREX with the TREX Admin Tool
1. Log on with the user <sapsid>adm. 2. Start the TREX admin tool by double-clicking <TREX_DIR>\TREXAdmin.bat in Windows Explorer. 3. In the TREX admin tool, navigate to the Landscape Services area and then go to the MMC tab.
4. You can stop TREX by clicking the SAP System: Stop button or by selecting the TREX host, opening its context menu with the secondary mouse button, and selecting the relevant option. You can stop Selected Hosts or All Hosts for a landscape.
6.
Your feedback is very valuable and will enable us to improve our documents. Please take a few moments to complete our feedback form. Any information you submit will be kept confidential. You can access the feedback form at:
http://www.surveymonkey.com/s.aspx?sm=stdoYUlaABrbKUBpE95Y9g_3d_3d
7. Copyright
2009 SAP AG. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG. The information contained herein may be changed without prior notice. Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors. Microsoft, Windows, Outlook, and PowerPoint are registered trademarks of Microsoft Corporation. IBM, DB2, DB2 Universal Database, OS/2, Parallel Sysplex, MVS/ESA, AIX, S/390, AS/400, OS/390, OS/400, iSeries, pSeries, xSeries, zSeries, System i, System i5, System p, System p5, System x, System z, System z9, z/OS, AFP, Intelligent Miner, WebSphere, Netfinity, Tivoli, Informix, i5/OS, POWER, POWER5, POWER5+, OpenPower and PowerPC are trademarks or registered trademarks of IBM Corporation. Adobe, the Adobe logo, Acrobat, PostScript, and Reader are either trademarks or registered trademarks of Adobe Systems Incorporated in the United States and/or other countries. Oracle is a registered trademark of Oracle Corporation. UNIX, X/Open, OSF/1, and Motif are registered trademarks of the Open Group. Citrix, ICA, Program Neighborhood, MetaFrame, WinFrame, VideoFrame, and MultiWin are trademarks or registered trademarks of Citrix Systems, Inc. HTML, XML, XHTML and W3C are trademarks or registered trademarks of W3C, World Wide Web Consortium, Massachusetts Institute of Technology. Java is a registered trademark of Sun Microsystems, Inc. JavaScript is a registered trademark of Sun Microsystems, Inc., used under license for technology invented and implemented by Netscape. MaxDB is a trademark of MySQL AB, Sweden.
SAP, R/3, mySAP, mySAP.com, xApps, xApp, SAP NetWeaver, and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and in several other countries all over the world. All other product and service names mentioned are the trademarks of their respective companies. Data contained in this document serves informational purposes only. National product specifications may vary. These materials are subject to change without notice. These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. These materials are provided as is without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. SAP shall not be liable for damages of any kind including without limitation direct, special, indirect, or consequential damages that may result from the use of these materials. SAP does not warrant the accuracy or completeness of the information, text, graphics, links or other items contained within these materials. SAP has no control over the information that you may access through the use of hot links contained in these materials and does not endorse your use of third party web pages nor provide any warranty whatsoever relating to third party web pages. Any software coding and/or code lines/strings (Code) included in this documentation are only examples and are not intended to be used in a productive system environment. The Code is only intended better explain and visualize the syntax and phrasing rules of certain coding. SAP does not warrant the correctness and completeness of the Code given herein, and SAP shall not be liable for errors or damages caused by the usage of the Code, except if such damages were caused by SAP intentionally or grossly negligent.
https://www.sdn.sap.com/irj/bpx/grc