Data Federation Glossary

Terms defined in this glossary appear in help topics in italics, like this: grid object archiving.

access control list

(ACL) A list of users and groups and the permissions (read, write, execute, delete, and owner) that control whether the users and groups can view, modify, invoke, and remove an object and edit the object’s ACL. Every object in the data catalog has an ACL.

ACL

See access control list.

ad hoc query

A mechanism that lets you directly query a database in SQL. The query must run through an existing database connector. You can run an ad hoc query using either the CLI or a JDBC driver. Ad hoc queries can be thought of as single-use database operations.

attribute

A property of a Data Federation directory, file, service, or other object. Every object in the data catalog has attributes. Each attribute has a name, a type (string, integer, float, date, time, or timestamp), and a value. System attributes are read-only; you can change the values of other attributes. You can also create new attributes and add them to objects.

authentication service

A service associated with a Data Federation domain that authenticates a Data Federation user’s identity and provides security credentials each time the user logs in. Data Federation can be configured to use third-party directory services as authentication services; for user accounts created in the Data Federation domain, Data Federation uses its own default authentication service.

cache service
A staging service that stores copies of files, database operation results, and data service results. Caching improves retrieval performance. To ensure that an object is stored in the cache, you can pin a file or directory in the data catalog, or schedule a database operation or data service. A cache service can provide remote caching, local caching, or both. The freshness of cached data is controlled by a data expiration interval that determines how long cached data is considered valid and by a cache coherence window that tells the cache service how often to check whether cached data is still valid. If cached data is too old to satisfy a new request (or is not stored in this cache), the cache service does one of the following:
  • If the database operation or data service that produced the data is local to this cache service, the cache service triggers execution of the database operation or data service.
  • If the database operation or data service that produced the data is remote from this cache service, this cache service requests the data from the data source’s local cache service.
A cache service can be associated with a data grid access server, a grid server, or a local user in a CLI session. See also local cache,remote cache,on-demand caching, and scheduled caching.
category

A mechanism for classifying and organizing the contents of the data catalog. Like Data Federation directories, categories serve as containers for objects in the data catalog. Anything in the data catalog—views, data services, shared files, even Data Federation directories themselves—can be assigned to a category. Categories are hierarchical, they have attributes, and access control lists regulate access to them.

CIFS client

A machine that mounts files or directories from the data catalog by connecting to a CIFS share through a data grid access server. A CIFS client need not have Data Federation software installed. (CIFS—Common Internet File System—is a file-sharing protocol based on the file system implemented by Windows.) Compare NFS client.

CIFS share

A directory or file that has been exported (shared) from the data catalog. A CIFS share can be mapped into a Windows file system like a network drive. When you browse the Windows file system, CIFS shares look like—and can be accessed like—other files and directories. CIFS shares are created through a data grid access server. Contrast with Data Federation share.

CLI
Command line interface. You can execute Data Federation commands on any machine where a grid server or command client is installed. The CLI provides most of the functionality of the Web UI and WorkSpace Data Federation.
client

Data Federation supports several types of client: WorkSpace Data Federation, CIFS clients, command clients, JDBC/ODBC clients, NFS clients, Web clients, and WS clients.

command client

A machine that can issue Data Federation CLI commands but does not contribute resources to the Data Federation domain.

connect port

The connect port on a grid domain controller, grid server, data grid access server, proxy server, or share server accesses the JNDI naming service or RMI registry for the underlying application server. The connect port is one of many ports that a GDC or server uses to communicate with other Data Federation objects. You must supply the connect port number of a target grid server or GDC whenever you connect a new object (for example, another server, a copy of Sybase WorkSpace, or a command client) to a Data Federation domain. When you interconnect two Data Federation domains, you must supply each domain’s connect port number to the other one.

connection profile

A configuration set used to access an endpoint. For example, the connection profile for a database endpoint defines the database name, server, user ID, and password parameters that facilitate the connection to the physical database. A server connection profile provides the information to connect directly to a specific server, such as a Data Federation grid domain controller.

custom data service deployer

A tool that lets you create and configure a data service based on your own data service plug-in (written in Java) and publish it in the data catalog. The deployer is part of WorkSpace Data Federation. (You can also use the Web UI to deploy data services based on your own plug-ins.)

database connector

A mechanism that enables one or more database operations, SQL views, or ad hoc queries to connect to a relational database.

database operation
(DBOP) A mechanism that can:
  • Extract data from a relational database and deliver it on demand to a view generator or a data service, or
  • Modify data in a relational database.
A database operation can be a SQL statement or a stored procedure call. Compare ad hoc query.
data catalog

A hierarchical structure similar to a file system that encompasses all objects in a Data Federation domain. The data catalog contains Data Federation directories and files, Data Federation shares, Data Federation servers, SQL views, database operations, data services, and other objects.

data grid access server

See DGAS.

Data Federation directory

Sybase Data Federation creates a single, unified namespace that is accessible (subject to Data Federation access control lists) to all users in the Data Federation domain. The namespace, called the data catalog, is arranged as a hierarchy of Data Federation directories (folders). The catalog directory structure is stored by the domain’s grid servers and its GDC, while the physical files remain in their original locations in your local file systems. It is important to distinguish between Data Federation directories, which are part of the data catalog, and local directories, which reside in your local file system. Compare with category.

Data Federation domain

The basic administrative unit of the Data Federation EII system. A Data Federation domain consists, at a minimum, of one grid domain controller and may also include one or more grid servers, share servers, proxy servers, data grid access servers, and command clients.

See also domain name.

Data Federation group

A set of users who have the same permissions on one or more Data Federation objects. You can use the group name in place of a user name when you set permissions or create access control lists.

Data Federation installation directory

The directory in your local file system where Data Federation software is installed. This is not a data catalog directory.

Data Federation server

A service that starts, stops, and monitors other Data Federation services on a particular computer. Every server is part of a Data Federation domain. A server is permanently attached to the computer where it is started. There are several types of server: data grid access servers, grid domain controllers, grid servers, proxy servers , and share servers.

Data Federation share

(Also shared directory.) A pointer in the data catalog to a directory or file in the underlying local file system. When you browse the data catalog, Data Federation shares look like—and can be accessed like—other Data Federation directories. Contrast with CIFS share.

data grid access server

(DGAS) A Data Federation server that makes Data Federation directories and their contents available to CIFS clients and NFS clients.

data service
An operation that transforms data obtained from sources in the data catalog. Input data can come from any number of sources, including:
  • Other data services
  • Data catalog files (which can be generated views)
  • Database operations (which in turn extract the data from relational databases)
  • HTTP requests
  • Web service invocations
You can generate the code that manipulates the data by creating a view model in WorkSpace Data Federation, or by writing a custom data service plug-in using Java, JavaScript, or XSLT. Data service output can be in rowset or XML format. Data services are run by the execution services on grid servers, they can be scheduled, and their results can be cached. See also data service plug-in.
data service plug-in

The logic for a data service, written in Java, JavaScript, or XSLT. Data service plug-ins are modular—you can use the same plug-in for multiple data services. Use the Data Federation Plug-in Wizard to help create data service plug-ins. You can deploy plug-ins as data services using either the Web UI or the custom data service deployer (for Java plug-ins only) in WorkSpace Data Federation.

dependency

A relationship in which a Data Federation object requires input from other Data Federation objects. A data service might require input from one or more database operations or from other data services. A view generator might depend on a database operation for input. A database operation can serve as an input source for one or more data services or view generators. Generated SQL views depend on database operations, virtual database operations, or data services. You can use WorkSpace Data Federation, the Web UI, or the CLI to list input and output dependencies for any data service, database operation, or view.

DGAS

See data grid access server.

distributed transaction
A set of related operations (typically SQL operations such as SELECT, INSERT, UPDATE, DELETE, and CALL) that:
  • Involve one or more databases, and
  • Might lead to unwanted results (such as leaving participating databases in an inconsistent state or producing inconsistent reads) if some of the operations complete and others do not, and that therefore
  • Must all be executed at once, as a single transaction.
The individual operations that make up a distributed transaction are performed by database operations that use database connectors configured with XA-capable JDBC drivers; all the database operations are executed (using the two-phase commit protocol) by a specially configured data service. The two-phase commit protocol is designed to ensure that the participating databases will be left in a consistent state—that is, all the operations in the distributed transaction will be completed, or none of them will.
domain name

A unique alphanumeric identifier for a Data Federation domain. The domain name is assigned by the Data Federation administrator when the Data Federation domain is initialized. The domain name has a maximum length of 30 characters.

enterprise information integration
(EII) A software system that:
  • Enables applications and users to access, without replication, both raw and integrated data from multiple heterogeneous distributed data sources while hiding the complexity of the data sources, and
  • Provides tools enabling users and data owners to further integrate and transform data.
exclusion

See schedule exclusion.

execution service

Execution services run data services. There is an execution service on every grid server, and you can configure a pool of execution services for load-sharing. When a pool is in place, a data service can be run by any execution service in its grid server’s pool.

failover

The transition of control from a failing or unreachable primary grid domain controller to a secondary grid domain controller.

federated data access

A scheme that allows independently controlled elements to be shared into a single namespace. Files, user accounts, and other objects maintain their separate identities and remain under the control of their owners, but—subject to access controls—the objects can be accessed, managed, and viewed as if they were part of a single system.

GDC

See grid domain controller.

generated view

A file created by a view generator. It may contain data obtained from a database operation, a data service, a file, or an HTTP source. Like other files, generated views exist in a local file system and are shared into the data catalog.

GOA

See grid object archiving.

grid

A heterogeneous group of networked resources that appears and functions as one operating environment. A data grid like the Data Federation Enterprise Information Integration (EII) system provides secure, shared access to data.

grid directory

See Data Federation directory.

grid domain

See Data Federation domain.

grid domain controller

(GDC) The first server in a Data Federation domain is the grid domain controller. The GDC maintains a portion of the data catalog and provides authentication services. It can also run Data Federation commands, share data, and monitor other servers. (That is, the GDC functions as a grid server.) If the domain is configured for failover, it has both a primary GDC and a secondary GDC; the secondary is updated at regular intervals and takes over management of the domain if the primary fails. Any Data Federation shares managed by the primary are read-only on the secondary.

grid object archiving

(GOA) A mechanism for packaging, storing, and redeploying grid objects such as data services, database operations, SQL views, and database connectors. GOA lets you copy services and data sources from one Data Federation domain to another, or make back-up copies of services and data sources that can be restored to their original servers if the need arises.

grid server

A Data Federation server that maintains a portion of the data catalog, runs Data Federation services such as shares, execution services, caches, and searches, and allows you to run Data Federation’s Web UI and execute Data Federation commands.

group

See Data Federation group.

hard link

Provides an alternate name for an item in the data catalog. Changes to the object’s other names have no effect on the hard link: you can move or change a file’s original name and the hard link still knows where to find the file. To delete a hard-linked object, you must remove its original name. Contrast with soft link.

interconnect

To create a unidirectional link from one Data Federation domain to another. Interconnecting lets a Data Federation domain make its data catalog visible to users in another domain (subject to Data Federation access controls).

JDBC driver
Java Database Connectivity drivers allow application programmers to access database data shared in the data catalog. When a JDBC driver accesses data, it returns a JDBC result set that is immediately available to your program. JDBC drivers can:
  • Call any data service in the data catalog
  • Call any database operation in the data catalog
  • Perform SQL select operations against SQL views in the data catalog
Sybase offers three JDBC drivers for use with Data Federation software:
  • The Avaki native JDBC driver, which is included with each grid server
  • jConnect, the Sybase standard JDBC driver
  • An XA-capable driver for use with database connectors that support distributed transactions
link

See hard link and soft link.

local cache

A cache service running on the same grid server as a database operation or a data service that generates cachable data. The local cache stores results produced by local database operations and data services so they do not have to execute for every new request. Compare remote cache.

metadata model

A construct in WorkSpace Data Federation that expresses a schema by defining a set of tables. A table in a metadata model can be mapped (linked) to a Data Federation object such as a data service or a database operation, or to a table in a relational database. The mapping lets you address each mapped object by the name of the corresponding table in the metadata model. You can also derive a view model schema from a metadata model. When you do this, you ensure that the results of any data service deployed from the view model conform to the metadata model’s schema.

NFS client

A machine that mounts the data catalog (or a portion of it) as a directory by connecting to a data grid access server. An NFS client need not have Data Federation software installed. (NFS—Network File System—allows you to add file systems located on a remote computer to the directory structure on your own computer.) Compare CIFS client.

ODBC

Open DataBase Connectivity is an API for databases on Windows. The ODBC driver included with Sybase Adaptive Server® Enterprise allows Data Federation to communicate with Windows database applications.

on-demand caching

A scheme by which an object is cached only if it is used—for example, results are cached when a database operation or a data service is executed, or a file is cached when a user or application reads it. On-demand caching uses a fixed expiration interval to determine data freshness. On-demand caching is suitable for objects that are rarely accessed or that change at irregular intervals. Contrast with scheduled caching.

pin

To mark aData Federation directory or file for scheduled caching. See also cache service.

plug-in

See data service plug-in.

primary GDC

See grid domain controller.

proxy server

A Data Federation server that allows Data Federation domains on opposite sides of a firewall or a Network Address Translator (NAT) to communicate with one another.

queries

See ad hoc query.

query engine

A Data Federation service that executes SQL queries against the SQL views (tables) that make up the Data Federation virtual database. A query engine analyzes queries, pushes as much of the work as possible down to the underlying relational database (if there is one), and performs the remaining operations (such as joins across tables from different databases) itself. There is a query engine on each grid server.

remote cache

A cache service running on a grid server that is remote from a Data Federation service (a database operation or a data service) which generates cachable data. The remote cache stores results produced by distant services so the results need not be fetched over the network to satisfy every new request. Users and applications that access remote data through the cache may have access to cached copies even when the remote data source is unavailable. Compare local cache.

scheduled caching

A scheme by which an object in the data catalog is cached according to a schedule that you create. The schedule specifies when the object is first cached and how often (or following what trigger event, such as a change to a file) the cache is refreshed. If the object is a data service or a database operation, the schedule runs it to put fresh results in the cache. Scheduled caching, which overrides other types of caching, is suitable for objects that are updated frequently or on a regular basis. Contrast with on-demand caching.

schedule exclusion

A named period of time during which scheduled activities can be prevented from running. You can apply an exclusion to as many schedules as you want. Scheduled activities include refreshing Data Federation shares and imported user accounts, and caching files, directories, and the results of database operations, data services, and generated views.

secondary GDC

See grid domain controller.

service

A Data Federation object that performs a function in the domain (stores data or authenticates users, for example). Services provided in Data Federation software include Data Federation directories, Data Federation shares, Data Federation servers, authentication services, execution services, and user accounts.

share

A point of connection between the data catalog and a native file system or file system tool. Data Federation supports two kinds of shares: Data Federation shares and CIFS shares.

shared directory

See Data Federation share.

share server

A Data Federation server whose only task is to manage Data Federation shares—local directories that are exported (shared) into the data catalog. (Grid servers can also manage shares.)

soft link

A pointer to a particular location (name) in the data catalog. If the object at that location is moved, deleted, or renamed, the soft link leads nowhere. Soft links can be created only in the CLI. Contrast with hard link.

SQL view
A virtual table—a data catalog entry that represents a table in a relational database, a database operation, or a data service. SQL views can be:
  • Provisioned directly from a table in an underlying database
  • Generated from a database operation or data service
  • Mapped from a database table, a database operation, or a data service, using the WorkSpace Data Federation metadata model editor
Every SQL view is part of the Data Federation virtual database. SQL views are treated as relational tables by the Data Federation query engine. SQL view data can be accessed using standard SQL statements by connecting to Data Federation with ODBC or JDBC, or via a Data Federation virtual database operation. See update notifications.
transactional

A unit of work that consists of several operations is transactional if the operations are treated as a single transaction. If all the operations cannot be completed, the entire transaction is rolled back. A transactional data service executes it own actions and all operations performed by its inputs (database operations and other data services, for example) as a single transaction. Data Federation supports XA-based distributed transactions and non-XA (nondistributed) transactions.

update notification

A message issued when a generated view is updated. A view that receives data from another view can be configured to regenerate itself (using the new data) upon receipt of an update notification. See generated view, SQL view, view generator, view model.

view

See generated view, SQL view,view generator, and view model.

view generator

A mechanism that does one of the following: extracts data from a file or an HTTP source, obtains data from a data service, or uses a database operation to extract data from a relational database. The view generator can display the data, perform an XSLT transform, save the data as a generated view file, and update a database. Contrast with data service.

view model

The graphical representation of a data flow that you can build in WorkSpace Data Federation. A view model typically includes one or more input sources (such as database operations or data services), one or more operators to combine or transform the data, and an output target. When you deploy a view model, it becomes a Data Federation data service.

See virtual database.
virtual database

The set of all SQL views in a Data Federation domain, including those provisioned from external databases and those generated from data services and database operations. You can execute SQL queries on the SQL views in the virtual database as if they were tables in a single database.

virtual database operation

A database operation whose source database is the Data Federation virtual database itself. Use virtual database operations to encapsulate and reuse SQL SELECT queries against SQL views (provisioned or generated).

Web services client

(Also WS client.) A tool or a piece of code that is part of a customer application and that makes SOAP calls to Web services on a grid server. The SOAP calls can request data from files in the data catalog, from a database operation, or from a data service.

Web UI

Web user interface. You can connect to Sybase Data Federation’s graphical Web UI by pointing a Web browser at a GDC or grid server. The Web UI provides most of the functionality of the CLI and WorkSpace Data Federation.

WorkSpace Data Federation
A graphical, metadata-based data integration tool that lets you:
  • Build data flows by dragging and dropping input sources, operators, and output targets. You can deploy your data flows as data services.
  • Import or create metadata models and apply them to Data Federation objects or use them to build new data services.
  • Store grid objects such as data services and database operations in archives, from which you can later deploy the objects.
In addition, you can use WorkSpace Data Federation to perform provisioning tasks (creating database connectors, database operations, virtual database operations, SQL views, and Data Federation shares), manipulate categories, and edit access control lists and attributes.
WS client

See Web services client.

For product-related issues, contact Sybase Technical Support at 1-800-8SYBASE. Send your feedback on this help topic directly to Sybase Technical Publications: pubs@sybase.com