Hello All,
I need below information about Jaspersoft ETL tool for data integration and ETL needs:
Development effort:
The development effort , time and complexity is more in general?
Maintainability:
Is it less maintainable?
Error Handling:
Only possesses a single log file? or possesses a log and error port in every transform?
What kind of errors can be handled?
Various teams needed:
Separate Administration team or Unix or NT Admin will suffice needed works. hence it does not need a dedicated administer?
File Structure:
Only able to read record with single type of delimiter?
Data Integration Capability:
ODI boasts comparatively lesser range of Data Integration Products and capability which includes many related functions such as profiling and data quality ? Also, if it offers these capabilities then these are more mainstream in nature?
Market Segments:
Serves medium to large scale companies?
Debugging:
Is it offer easy debugging? Example -just place some watchers on required places and intermediate data will be saved in temporary files for easy viewing. or complex debugging process through debugger?
Company Strategy:
You can download a scaled down free version of their software and plenty of free documents available on internet?
Go live rate:
High “GO Live” success? any know issue during deployment?
Scalability:
Is there any issue with stability? If yes then why is the issue and what is impact?
Which kind of scalability is supported- horizontal, vertical?
Performance:
Can it supports High volume of data movement, transformation and integration (ETL operations)?
How about parallelism - mapping level parallelism, session level parallelism, supports multiple parallel source and multiple target data loads?
Heterogeneous system:
It integrates data from various heterogeneous systems like multiple variety of databases (SQL server, Oracle, DB2 etc), files (XML, XLS, CSV, text etc)?
Targets can be any type of DB , file etc.?
Big Data support:
It can be integrated and used for Big Data?
On cloud solution:
It is available for both- on cloud and on premises platforms?
Pricing:
Is it free ware - open source? Does it come in basic, standard and enterprise editions flavors? If yes , all flavors are free?
Repository:
Does it offers repositories ? Those repositories are for metadata?
Host for repositires should be relational database?
Push down mechanism:
Do we have pushdown optimization concepts, where it can generate SQL statements from the workflow/mapping which can be directly executed on database?
It is ETL or ELT tool?
Job scheduling:
Does it come with in-built scheduler?
Version controlling:
Does it offer version controlling?
If yes then it is tightly controlled or moderate?
Tool Bugs:
Any known tool bugs? Any issue due to those bugs?
Anything else you want to highlight?
Thanks,
Rajneesh
1 Answer:
For starters, Jaspersoft ETL is an OEM of the Talend ETL product. Licensing through Tibco allows the outputs of the ETL process to be only used by Jaspersoft products (JasperReports Server, JasperReports IO, Jaspersoft Studio)
Development effort: Depends on the complexity of your transformation needs.
Maintainability: It can integrate with numerous source control tools.
Error Handling: You have complete control over error handling within the ETL jobs.
Various teams needed: It does not need a dedicated administrator.
File Structure: Support a wide variety of data sources, both as inputs and outputs, including a variety of delimited files.
Only able to read record with single type of delimiter?
Data Integration Capability: A wide varity of a=outputs supported, as mentioned.
Market Segments: Serves medium to large scale companies? Yes
Debugging: Is it offer easy debugging? Fully integrated debugging?
Company Strategy: Check Talend site for free verrsion.
Go live rate: Extensively used. No known isssues.
Scalability: Highly scalable, both horizontally and vertically.
Performance:
Can it supports High volume of data movement, transformation and integration (ETL operations)? Yes
How about parallelism - mapping level parallelism, session level parallelism, supports multiple parallel source and multiple target data loads? Yes
Heterogeneous system:
It integrates data from various heterogeneous systems like multiple variety of databases (SQL server, Oracle, DB2 etc), files (XML, XLS, CSV, text etc)? Yes
Targets can be any type of DB , file etc.? Yes
Big Data support:
It can be integrated and used for Big Data? Yes
On cloud solution:
It is available for both- on cloud and on premises platforms? Yes.
Pricing:
Is it free ware - open source? Does it come in basic, standard and enterprise editions flavors? If yes , all flavors are free? Not all free. Basic job creation is in the community edition, commercial edition required for advanced features such as scheduling, Big Data, etc.
Repository:
Does it offers repositories ? Those repositories are for metadata? Yes
Host for repositires should be relational database? Yes
Push down mechanism:
Do we have pushdown optimization concepts, where it can generate SQL statements from the workflow/mapping which can be directly executed on database? Yes
It is ETL or ELT tool? Both
Job scheduling:
Does it come with in-built scheduler? Yes,
Version controlling:
Does it offer version controlling? Yes
If yes then it is tightly controlled or moderate? Your choice
Tool Bugs:
Any known tool bugs? Any issue due to those bugs? Not as afar as I know - see Talend for more details.
Anything else you want to highlight? Jaspersoft ETL outputs are for consupomption by Jaspersodft tools only.