Main Settings (Remote Windows Content Crawler)

Note: You must install Oracle WebCenter Content Service for Windows Files to access this functionality.

To learn about content crawlers and content services, click here.

To specify the file configuration settings, destination folder, and security for documents imported by this content crawler:

Specify the number of folder levels into which this content crawler will crawl into your file system. For example, the maximum number of levels to crawl setting has the following effects:

If you select Unlimited, the content crawler will crawl all subfolders in the target folder.
If you select 0, the content crawler will crawl only the target folder.
If you select 1, the content crawler will crawl the target folder and its first level of subfolders.

In this way you can prevent your content crawler from importing unwanted content.

In the File path box, type the UNC path of the target folder, the folder where this content crawler should begin crawling. For example, \\myComputer\myFolder\mySubfolder.
Specify into which folders you want to import content. The content crawler attempts to import a link to every document it finds into the most subordinate subfolder within the destination folder that allows the link to pass. Click here for a flow chart showing how the content crawler determines into which folders it will import content.

To add destination folders, click Add Folder; then, in the Choose Folders dialog box, select the folders you want to add and click OK. To crawl documents into a folder, you must have at least Edit access to that folder.
To remove a folder, select the folder and click .
To select or clear all of the folder check boxes, select or clear the box to the left of Folder Path.
To toggle the order in which the folders are sorted, click Folder Path or click the icon to the right of that— (sort ascending, a-z) or (sort descending, z-a)..

If the content Web service used by this content crawler supports folder mirroring (specified on the Advanced Settings page of the Content Web Service Editor), you can have this content crawler create Knowledge Directory folders that duplicate the folder structure of the content repository being crawled by selecting Mirror the source folder structure.

Notes:

If you mirror the folder structure and import security information with each document (described in Step 7), the folder security is imported for the mirrored folders.
If you mirror the folder structure, upon successive runs the content crawler removes any portal folders that do not have corresponding source folders. For this reason, if you run this content crawler periodically, neither you nor anyone else should modify the mirrored portal folders or documents in any way.
You cannot change the mirror setting after creation of this content crawler. That is, if you set this content crawler to mirror the folder structure, you cannot edit this setting later.

To require that documents pass the filters of destination folders before the documents are imported into those folders, select Apply Filter of Destination folder. By default, documents do not need to pass the filters of destination folders, so all documents will be imported into all destination folders.
To accept all imported documents into the portal and make them immediately available to users, select Automatically approve imported documents. By default, documents require approval. This means that before the link to the imported document is available to users, it must be approved by a portal administrator with at least Edit access to the destination folder.

If you are mirroring the folder structure, you might want to set imported documents to be approved automatically and restrict users to Read access (users in the Administrators group always have Admin access). If you set imported documents to require approval, be aware that any portal administrator who has at least Edit access can also modify the folders and content, and can therefore make your portal folders and content out of sync with your source repository.
If the content Web service used by this content crawler supports security importation and the source repository users have been imported into the portal and mapped in the Global ACL Sync Map, you can have this content crawler import security information with each document by selecting Import security with each document. This automatically makes documents available to source repository users available to corresponding portal users.

Note: Because read access is equivalent in the source repository and the portal, but write access is not, only read access is imported; write access is ignored because write access to a document in an external repository allows you to edit the document, but write access (referred to as Edit access) in the portal allows you to edit the Properties and security settings of that document.
Under Document Access Privileges, you can perform the following actions to grant users and groups access to the content imported by this content crawler:

To add users or groups, click Add Users/Groups; then, in the Choose Groups and Users dialog box, select the groups and users you want to add and click OK. To select groups and users, you must have at least Select access to them.
For each user or group, in the associated Privilege drop-down list, choose the access privilege you want to grant for content imported by this content crawler.
To remove a user or group, select the user or group and click .
To select or clear all of the user and group check boxes, select or clear the box to the left of Users/Groups.
To toggle the order in which the users and groups are sorted, click Users/Groups or click the icon to the right of that— (sort ascending, a-z) or (sort descending, z-a).
To view the members of a group, click the group name.

To display the page associated with this help topic:

Click Administration.
Open the Remote Content Crawler Editor:

To create a new remote content crawler:

Open an administrative folder.
In the Create Object menu, click Content Crawler - Remote.
In the Choose Web Service dialog box, select the appropriate Web service and click OK.

To edit an existing remote content crawler:

Navigate to the remote content crawler you want to edit.
Click the remote content crawler name.