Introduction: The Data Source Splitter feature makes it easier and more efficient for a company to analyze several domains over the same set of log files.
For example, it could be used if a corporation has several virtual domains all served by the same set of web servers and all logged into the same set of log files. With Data Source Splitter, you can split log files into smaller logs based on virtual domains (or some other meaningful dividing criteria), create profiles for each of those virtual domains, specify additional profiles settings appropriate to each virtual domain, and then run the profiles on the smaller virtual domain-specific log files.
Data Source Splitter profiles specify a Data Source Splitter definition instead of the log files directly and provide you full flexibility for using all profile features. This way, multiple profiles can point to the same Data Source Splitter definition file. Otherwise, you would need to configure a full standard profiles, defining the same set of log files each time, for each analysis.
Note: The only place where Data Source Splitter modifies the profile is in the log file paths. Data Source Splitter modifies the profiles logfilepath setting to match the location where it is splitting the profile's log files. The remainder of the profile settings are the same as a standard profile.
How does the Data Source Splitter work? To use the Data Source Splitter, you create a Data Source Splitter profile, instead of a standard Webtrends Reporting Center profile, and point it to a Data Source Splitter definition. Note: Parent-Child profiles also analyze several domains over the same set of log files, but the profiles for each domain are automatically created.
Create a Data Source Splitter definition.
Create a profile which specifies the Data Source Splitter definition.
Analyze the profile to split the log files.
To create a Data Source Splitter definition, follow these steps. 1. From the links at the top of the Admin Console, select Data Source Splitter. The Data Source Splitter Definitions dialog box opens. 2. Click New. The Add New Data Source Splitter Definition dialog box opens to the Options tab. 3. In the Description text box, type a name for your Data Source Splitter definition. This name will appear in the Profiles list of available Data Source Splitter definitions. 4. From the Split Field drop-down list, select either Domain or URL to specify which field will be used in splitting the log file. 5. In the Split Tolerance text box, type the number of minutes that separate the log file splits. The default value is 120. To save processing time, you may want to increase this so that the log files are not split as often. 6. In the Split Log Directory text box, type the path to the directory where the split log files will be stored. The default setting is:
7. Select the Log Data Sources tab. 8. Click Log Data Sources. 9. From the Select Log Data Sources list, select the check boxes to the left of the log data sources that you want to include with this Data Source Splitter definition. 10. If necessary, add or edit existing log data sources as needed for this definition. 11. Click OK, then OK again to confirm saving your changes. 12. Select the Virtual Domains tab. 13. Select a virtual domain for this definition to use in its processing. You must select at least one. Steps for adding a virtual domain are provided in the next section. 14. Click Save. The new Data Source Splitter definition appears in the Data Source Splitter Definitions dialog box.
To add a virtual domain, follow these steps. 1. From the links at the top of the AdminConsole, select Data Source Splitter. 2. Either select a definition and click Edit, or click New. 3. Select the Virtual Domains tab. 4. Click New. 5. In the Description text box, type a description of this virtual domain. This description shows in the Data Source Splitter Definitions dialog box. 6. In the Pattern text box, type the pattern to use as split criteria. If you wish to use regular expressions for your pattern entry, select the Regular Expression check box. 7. If you are using a regular expression in defining your pattern, click Test to confirm your data. 8. When you have completed testing your settings, click OK. Your new virtual domain information appears in the Data Source Splitter Definitions dialog box.
To add a new Data Source Splitter profile, follow these steps. Create new Data Source Splitter profiles with the Add Profile Wizard. 1. From the Profiles menu on the left of the Admin Console, click New. The Add New Profile wizard opens. Select the type of content to be analyzed.
Web - Use this for standard web content.
Streaming Media - Use this for analyzing streaming media server log files.
2. For Choose The Type of Web Server Profile, select Data Source Splitter Profile (DSS). Note: A Data Source Splitter definition must be created before creating a Data Source Splitter profile for this option to show. If no Data Source Splitter definition has been defined, this option does not appear. 3. Click Next. You will see the General dialog box. 4. In the Name text box, type the name for this profile. This name also serves as a default profiles title in reports. 5. Specify how much of the log to analyze. To analyze the entire log file, select From Start of Log File(s). To analyze the log beginning with a date specified, select From The Following Date. Type the date. 6. If you need to adjust for time zone differences, in the Time Zone area, select the Override The GMT Offset To Use For Processing Reports check box. In the text box that appears, enter the value, in hours, that you want Webtrends Reporting Center to use. 7. In the Profile File Name text box, type the name for this profile. This name is listed in the Profile Statistics section below the list of profiles on the AdminConsole. 8. If you want to archive this profile, select Enable Archiving. 9. If you do not want this profile to be included in analysis, select Disable This Profile. 10. To capture web page titles for inclusion in reports, select Retrieve HTML Page Titles. 11. Click Next. 12. The Data Source Splitter dialog opens. 13. Select the Data Source Splitter definition and virtual domain that refers to the log file for this profile. 14. Click Next. 15. Finish creating your new Data Source Splitter profile.
Use the following settings to configure Data Source Splitter through the command line. [profile] isecm =  ecmfile = 7 ecmdefinition = 9qfG7pB0z15
isecm - tells whether or not there is a Data Source Splitter file. A value of 0 means there is not a file while a value of 1 means there is.
ecmfile - identifies the Data Source Splitter file. The default directory where the file is stored is:
ecmdefinition - a GUID matching one of the ID fields within an [ecmprofilex] section of the Data Source Splitter definition file. These profile settings identify the Data Source Splitter file, which includes general settings such as the log files to be split and the split tolerance. The profile also identifies the ecmprofile within the Data Source Splitter file, which contains the settings that indicate which set of split log files will be analyzed by this profile.