top of page
Search
daikingsocaheartse

Ab Initio ETL Tool Tutorial for Beginners PDF 26: A Comprehensive Overview of Ab Initio Architecture



Ab Initio, also known as Abinitio, is a tool used to extract, transform and load data. 'Abinitio' is a Latin word that means 'from the beginning'. It was named Abinitio because Sheryl Handler and their team started it after the bankruptcy of their previous company. Sheryl Handler was the former CEO of Thinking Machines Corporation, and he decided to start this company as a new beginning when the Thinking Machines Corporation went bankrupt.


Q.what is local and formal parameter ? Answer: Two are graph level parameters but in local you need to initialize the value at the time of declaration where as globle no need to initialize the data it will promt at the time of running the graph for that parameter. local parameter is like local variable in c language where as formal parameter is like command line argument we need to pass at run time. Q.what is BRODCASTING and REPLICATE ? Answer:Broadcast can do everything that replicate does broadcast can also send singlt file to mfs with out splitiong and brodcast makes multiple copies of single file mfs. Replicate combines data rendomly, receives in single flow and write a copy of that flow in each of output flow. replicate generates multiple straight flows as the output where as broadcast results single fanout flow. replicate improves component parallelism where as broadcast improves data parallelism. Broadcast - Takes data from multiple inputs, combines it and sends it to all the output ports. Eg - You have 2 incoming flows (This can be data parallelism or component parallelism) on Broadcast component, one with 10 records & other with 20 records. Then on all the outgoing flows (it can be any number of flows) will have 10 + 20 = 30 records Replicate - It replicates the data for a particular partition and send it out to multiple out ports of the component, but maintains the partition integrity. Eg - Your incoming flow to replicate has a data parallelism level of 2. with one partition having 10 recs & other one having 20 recs. Now suppose you have 3 output flos from replicate. Then each flow will have 2 data partitions with 10 & 20 records respectively. Q.what is the importance of EME in abinitio? Answer: EME is a repository in Ab Inition and it used for checkin and checkout for graphs also maintains graph version. Q.what is m_dump Answer: It is a co-opating system's command that we use to view data from the command prompt. m_dump command prints the data in a formatted way. m_dump Q.what is the syntax of m_dump command? Answer: m_dump Q.what are differences between different GDE versions(1.10,1.11,1.12,1.13and 1.15)? Answer: what are differences between different versions of Co-op? 1.10 is a non key version and rest are key versions. There are lot of components added and revised at following versions. Q.How to run the graph without GDE? Answer: In the run directory a graph can be deployed as a .ksh file. Now, this .ksh file can be run at the command prompt as: ksh Q.What is the Difference between DML Expression and XFR Expression ? Answer: dml expression means abinitio dml are stored or saved in a file and dml describs the data interms of expressions that performs simple computations such as files, dml also contains transform functions that control data transforms,and also describs data interms of keys that specify grouping or non grouping ,that means dml expression are non embedded record format files .xfr means simply say it is non embedded transform files ,Transform function is express business rules ,local variables, statements and as well as conn between this elements and the input and the ouput fields. Q.How Does MAXCORE works? Answer: Maxcore is a temporary memory used to sort the records Maxcore is a value (it will be in Kb). Whenever a component is executed it will take that much memory we specified for execution Maxcore is the maximum memory that could be used by a component in its execution. Q.What is $mpjret? Where it is used in ab-initio? Answer: $mpjret is return value of shell command "mp run" execution of Ab-Initio graph. this is generally treated as graph execution status return value Q.What is the latest version that is available in Ab-initio? Answer: The latest version of GDE ism1.15 AND Co>operating system is 2.14 Q.What is mean by Co>Operating system and why it is special for Ab-initio ? Answer: Co-Operating systems, that itself means a lot, it's not merely an engine or interpretor. As it says, it's an operating system which co-exists with another operating system. What does that mean.... in layman's term abinitio, unlike other applications, does not sit as a layer on top of any OS? It itself has quite a lot of operating system level capabilities such as multi files, memory management and so on and this way it completely integrate with any other OS and work jointly on the available hardware resources. This sort of Synergy with OS optimize the utilization of available hardware resources. Unlike other applications (including most other ETL tools) it does not work like a layer and interprete the commands. That is the major difference with other ETL tools , this is the reason why abinitio is much much faster than any other ETL tool and obviously much much costlier as well. Q.How to take the input data from an excel sheet? Answer: There is a Read Excell component that reads the excel either from host or from local drive. The dml will be a default one. Through Read Excel component in $AB_HOME we can read excell directly. Q.How will you test a dbc file from command prompt ?? Answer: You can test a dbc file from command prompt(Unix) using m_db test command which gives the checking of data base connection, version of data base, user Q.Which one is faster for processing fixed length dmls or delimited dmls and why? Answer: Fixed length DML's are faster because it will directly read the data of that length without any comparisons but in delimited one,s every character is to be compared and hence delays Q.what are the contineous components in Abinitio? Answer: Contineous components used to create graphs,that produce useful output file while running continously Ex:- Contineous rollup,Contineous update,batch subscribe Q.How can I calculate the total memory requirement of a graph? Answer:




ab initio etl tool tutorial for beginners pdf 26



Broaden, deepen and polish your knowledge as a data manager or consultant. Use this online guide as a training tool for yourself and your project team. Work through all the relevant definitions and jargon and get inspired by the valuable ideas in the online video library. 2ff7e9595c


1 view0 comments

Recent Posts

See All

Comments


bottom of page