| 1.WWWGrab |
|
29/Jun/2007 |
|
|
WWWGrab is a Windows program that scans lists of web pages, fetching the data at each URL (Uniform Resource Locator) and parsing the data with the Set Machine data transformer.
|
|
|
| 2.Set Machine |
|
01/Jul/2007 |
|
|
Set Machine is a highly configurable Pattern Recognition and Manipulation Utility for Emails and Files. It allows the user to define unlimited combinations and sequences of patterns, and actions to perform when patterns are recognized. It accepts input from stored emails (via MAPI) and/or files. It can generate database tables (via ODBC) and/or files. It can be configured to parse emails (header fields, text bodies) or perform transformations on files.
Set Machine can perform a wide variety of tasks because its design recognizes that many transformation tasks (parsing/extraction/conversion/searches etc.) involve the same basic repetitive process: -
- recognition of patterns in the input,
-
- transition to another "state" based on recognition of the next pattern in the input.
Internally, Set Machine is very general and abstract. The user defines the details of the transformation task. As a result, Set Machine is both flexible and challenging.
Features: -
* Recursive capabilities (enabling parsing of nested HTML/XML tags, comments, etc.)
-
* Wide-string (Unicode) input / output capability
-
* ODBC interface making database layout info (table and field names) available to the configuration developer
-
* ODBC interface allowing generation of arbitrary SQL statements built with a combination of user-defined data and parsed data
-
* User-defined function interface allowing execution of custom DLL code
Set Machine can be configured to :-
* Parse emails
-
* Search for (and replace) text
-
* Generate C/C++ code, HTML, XML, and other formats from various sources (emails, C/C++ code, HTML, XML, etc.)
-
* Parse C/C++ source code
-
* Generate and execute SQL
-
* Count words/keywords
-
* Count lines
-
* Swap bytes
Set Machine can be configured to perform a practically limitless variety of tasks.
|
|
|
| 3.DTBuild |
|
16/Mar/2009 |
|
|
DTBuild is a configurable and highly flexible pattern recognition / extraction / manipulation utility (parser) for emails and files. It allows the user to define patterns, combine them in any sequence, and perform actions when the patterns are recognized. It accepts input from stored emails (via MAPI) and/or files. It can generate database tables (via ODBC) and/or files. It can be configured to parse emails (header fields, text bodies) or perform transformations on files.
DTBuild can perform a wide variety of tasks because its design recognizes that many transformation tasks (parsing/extraction/conversion/searches etc.) involve the same basic repetitive process:
- recognition of patterns in the input,
- transition to another "state" based on recognition of the next pattern in the input.
Internally, DTBuild is very general and abstract. The user defines the details of the transformation task. As a result, DTBuild is very flexible, (but can be challenging!).
Features:
* Recursive capabilities (enabling parsing of nested HTML/XML tags, comments, etc.)
* Wide-string (Unicode) input / output capability
* Stored email (MAPI) interface
* ODBC interface making database layout info (table and field names) available to the configuration developer
* ODBC interface allowing generation of arbitrary SQL statements built with a combination of user-defined data and parsed data
* User-defined function interface allowing execution of custom DLL code
DTBuild can be configured to :
* Parse emails
* Search for (and replace) text
* Repair data
* Generate C/C++ code, HTML, XML, and other formats from various sources (emails, C/C++ code, HTML, XML, etc.)
* Parse C/C++ source code
* Generate and execute SQL
* Count words/keywords
* Count lines
* Swap bytes
DTBuild can be configured to perform a practically limitless variety of tasks.
|
|
|
| 4.DTRun |
|
19/Mar/2009 |
|
|
DTRun runs data transformations created with DTBuild. DTRun is an affordable data transformation tool for users who don't require the full configuration capabilities of DTBuild. DTRun can scan and recognize data in files and emails, and output data to files and databases.
DTRun allows the user to specify most of the parameters used by a data transformation:
* lists of input files / message folders
* lists of output files / database destinations
* lists of search text items, and replacement text when applicable,
DTRun does not provide advanced configuration capabilities such as node creation - for that you need DTBuild.
DTRun features:
* Wide-string (Unicode) input / output capability
* MAPI interface (for scanning stored emails)
* ODBC interface (for database creation and manipulation)
DTRun can be used to :
* Parse emails
* Search for (and replace) text in files
* Filter data
* Repair data
* Parse C/C++ and other programming languages
* Generate C/C++ code, HTML, XML, and other formats from various sources (emails, HTML, XML, RTF, C/C++ code, etc.)
* Generate and execute SQL
* Count recognized input patterns: words, specific keywords, line breaks, etc.
... In conjunction with DTBuild DTRun can be used to perform a practically limitless variety of tasks.
|
|
|
| 5.DGOTool |
|
12/Nov/2009 |
|
|
DGOTool is a cost-effective, multi-purpose search and transformation tool for emails, files and web pages. It extracts specified patterns and transmits them, and other data, to databases and files. It can: search emails, files and web pages * gather data into files + databases * convert formats * filter data * repair data * count recognized items (keywords, etc.).
DGOTool runs data transformations created with DGOTool D. It also allows the user to specify most of the transformation parameters:
* lists of input files / email folders / URLs
* lists of output files / database destinations
* lists of search text items, and replacement text when applicable
... plus other parameters.
DGOTool does not provide advanced configuration capabilities such as node creation. Full configuration capability is provided by DGOTool D.
DGOTool features:
* HTTP interface (for getting Internet data)
* MAPI interface (for scanning stored emails)
* ODBC interface (for database creation and manipulation)
* Wide-string (Unicode) input / output capability
DGOTool can be used to :
* Parse emails
* Scrape the web
* Search for (and replace) text in files
* Filter unwanted data
* Repair data
* Parse C/C++ and other programming languages
* Generate C/C++ code, HTML, XML, and other formats from various sources (emails, HTML, XML, RTF, C/C++ code, etc.)
* Generate and execute SQL
* Count recognized patterns in the input stream: words, specific keywords, line breaks, etc.
... DGOTool, in conjunction with DGOTool D, can be used to perform a practically unlimited variety of tasks.
|
|
|
| 6.DGOTool D |
|
12/Nov/2009 |
|
|
DGOTool D is a multi-purpose search and transformation development workshop for files, emails (via MAPI) and web pages. The user defines patterns to recognize, links between patterns, and actions to perform with recognized patterns and other data. DGOTool D uses ODBC to display database structure and transmit SQL; it produces file output as well. Use it to develop file searchers + transformers, email parsers, web scrapers, and more.
DGOTool D can perform a wide variety of tasks because its design recognizes that many transformation tasks (searching/parsing/extraction/conversion/reformatting etc.) involve the same basic repetitive process:
- recognition of a pattern in the input,
- transition to another "state" based on recognition of the next pattern in the input.
The user defines the details of the transformation task. As a result, DGOTool D is very flexible. Existing definitions (DXD files) can simplify development.
Features:
* Recursive capabilities (enabling parsing of nested HTML/XML tags, comments, etc.)
* HTTP interface (for getting Internet data)
* Stored email (MAPI) interface
* ODBC interface that displays database layout info (table and field names) to the user
* ODBC interface allowing construction of SQL statements with a combination of user-defined data and recognized data
* Wide-string (Unicode) input / output capability
* Trace mode to show correspondence between input and nodes (for debugging)
* User-defined function interface allowing execution of custom DLL code
DGOTool D can be configured to :
* Parse emails
* Search for (+ replace) text in files
* Filter data
* Repair data
* Parse C/C++ and other programming languages
* Generate C/C++ code, HTML, XML, RTF, and other formats from various sources (emails, C/C++ code, HTML, XML, RTF, etc.)
* Generate and execute SQL
* Count recognized input patterns: words, specific keywords, line breaks, etc.
... it can be configured to perform a practically limitless variety of tasks.
|
|
|
|
|
|