Fundamentals: Artificial Neural Networks are to deep learning, what atoms are to matter

Blog

Home Blogs Fundamentals: Artificial Neural Networks are to deep learning, what atoms are to matter

February 26, 2020

by Tavish Aggarwal

0 Shares

The human brain is, undoubtedly, the most extraordinary technology created by nature. An artificial neural network (ANN), modeled after the human brain, is a system software or hardware that performs tasks as the neurons of a human brain would, to an extent, by transferring information along a predefined path between neurons. ANNs consist of layers of interconnected neurons that receive sets of inputs and weights. They then perform mathematical manipulation and give out a set of activations as an output that is similar to synapses in biological neurons.

Today, neural networks can be used to achieve complex tasks in the fields of machine learning and artificial intelligence (AI), including object recognition, automatic speech recognition (ASR), machine translation, image captioning, video classification, and more.

Even this is a limited list. Artificial neural networks are the most potent learning models in the field of machine learning. They can achieve arguably every task that the human brain can perform, albeit they might work differently than an actual human brain.

Human Brain and Neural Networks – A Comparison

At a high level, neural networks – both artificial and biological – consist of four components:

Neurons
Topology – the connecting path between neurons
Weights
A learning algorithm

There are substantial differences in each of these components when it comes to biological and artificial neural networks. Some of the key differences are as follows:

An ANN typically consists of hundreds to at best thousands of neurons, whereas the biological neural network consists of billions of neurons.
BNNs have trillions of adjustable parameters, whereas even the most complicated ANNs only have several million learnable parameters.
ANNs primarily use some gradient descent models for learning, whereas when it comes to biological neural networks, even leading neuroscience and cognitive science experts do not have much clarity on the learning methods used.
The processing speed for biological neurons is usually in milliseconds, whereas standard ANNs can process information much faster (in nanoseconds).
Natural neural networks have extremely complicated topologies, whereas artificial neural networks have standard and comparatively simple paths.
Biological neural networks consume significantly lesser power as compared to artificial networks.

How do Neural Networks Work?Essentially, a neuron is a node with several inputs and one output, and many interconnected neurons form a neural network. For neural networks to perform their tasks, they need to go through a 'learning phase' – which means they need to learn to correlate incoming and outgoing signals. Once done, they begin to work, i.e., receive input data and generate output signals based on the accumulated information.

Biological neurons receive signals through dendrites, which either amplify or inhibit the signals as they pass through the axons to the dendrites of other neurons. Similarly, the ANNs learn to inhibit or amplify the input signals to perform a specific task.

Neural Networks and Deep Learning

More often than not, deep learning developers take into account the features of the human brain— the architecture of its neural networks, learning and memory processes and so on – for their deep learning projects which usually need a massive amount of data to train the system to classify signals clearly and accurately.

In this post, we will try to delve into the basics of neural networks and how they work in the field of deep learning.

Perceptrons – Earliest Neural Networks

One of the first neural networks to be invented was the perceptron. The perceptron was an elementary neural network with only one neuron and the Heaviside function (a unit step function whose value is zero for negative arguments and one for positive arguments) as a non-linearity.
Suppose you want to plan a family trip. From the following factors which are the critical factors for you?

Trip Expenditure
Trip Duration
Hotel Comfort and Cuisine Preference
Mode of Travelling

How our human brain works is it assigns weight to each of the factors. For one, trip duration may be the most critical factor, while for someone else it might be the trip expenditure. A perceptron works similarly. It takes input signals as inputs and performs a set of simple calculations to arrive at a decision.

Perceptrons can be used in classification tasks as well, where we fit a divider such that it divides the data points into regions. Perceptron can also perform multiclass classification as well. We can use multiple perceptrons where each perceptron classifies data points into various categories.

Training of Perceptron

Now that we know fundamentally what a perceptron is, let's look at the iterative solution for training the perceptron suggested by Frank Rosenblatt, a notable American psychologist in the field of artificial intelligence. Rosenblatt suggested an elegant iterative solution to train the perceptron (i.e., to learn the weights).

He proposed that we start with random weight and keep on adding the error term to the weight till the time we didn't find the valid separator. The error term is the misclassified point from the previous separator.

Working of Neurons

Neurons are similar to perceptrons; the only difference being that there is an activation function applied to the weighted sum of inputs.

In perceptrons, the activation function is the step function, though, in artificial neural networks, it can be any non-linear function. Few fundamental properties of neural networks are:

Neurons in a neural network are arranged in layers where the first and the last layer are called the input and output layers.
Input layers have as many neurons as the number of attributes in the data set.
The output layer has as many neurons as the number of classes of the target variable in case of the classification problem.
The output layer has one neuron in case of the regression problem.

Generally, the industry used neural networks follow the below-mentioned assumptions:

Neurons are arranged in layers, and the layers are arranged sequentially.
Neurons within the same layer do not interact with each other.
All inputs enter the network through the input layer, and all outputs leave the network through the output layer.
Neurons in consecutive layers are densely connected, i.e., all neurons in layer l are connected to all neurons in layer l + 1.
Every interconnection in a neural network has a weight associated with it, and every neuron has a bias associated with it.
All neurons in a particular layer use the same activation function.

It is important to note that the input to a neural network can only be numeric. So how to solve the problem where the input that we have is text (NLP problems) or images (computer vision problems)?

Text Data as Input: In the case of text data, we either use a one-hot vector or word embeddings corresponding to a particular word. If we need to work with a vast vocabulary, then it is recommended to use word embeddings over one-hot vectors.
Images as Input: In the case of images (or videos), it is quite straightforward since images are naturally represented as pixels (arrays of numbers), where each pixel of the input image is a feature. If we have a grayscale image of size 18 x 18, the input layer would need 324 neurons. If the image is a colored image, we need 18 x 18 x 3 neurons in the input layer, as the color image needs three channels (red, green, and blue).

Activation Functions

We know that the weighted sum of neuron passes through activation function before it goes as an input to neuron in the next layer. The activation function could be any function, though it should have some important properties such as:

Smoothness i.e., they should have no abrupt changes when plotted because decision making doesn't change abruptly based on any factor.
They should also make the inputs and outputs non-linear with respect to each other to some extent. This is because non-linearity helps in making neural networks more compact.

Few popular activation functions are:

Logistic function
Hyperbolic tangent function
Rectilinear Unit (ReLU)

Training of the Neural Network

The weight and bias of every individual neuron need to train to get the right predictions. Training of neural networks is similar to any other machine learning algorithm like SVM, linear regression, where the objective is to find optimal weights and biases to minimize the loss function, which can be complex even for a simple network. For solving real-world problems, there will be an exponentially large number of weights and biases that need to be minimized. Keeping the mentioned complexity in mind. Let's see the steps involved in training the neural networks:

Feedforward: The information flows (or training of the network) in a neural network from the input layer to the output layer.
Backpropagation: The adjustment of the weights to minimize the loss function.

But that's a topic for another day when we take a deeper dive into the fundamental building block of artificial neural networks. Note: prerequisites for the next level include a basic understanding of statistical concepts and matrix multiplication.

Conclusion

Artificial Intelligence has, irrefutably, permeated several aspects of our life and has become the new normal. With the increasingly human-level accuracy of performing tasks pattern recognition, image classification, and more, the industry has revolutionized how we connect with machines every day.

A growing body of research and experimentation in the field of deep learning application is gradually normalizing AI into our day-to-day lives in the form of face and speech recognition and self-driving technology, to name a few. So how deep an impact can deep learning have in the digital transformation of businesses and how the world around us works? Human brains are working their neurons hard to push the limits of what artificial neural networks can achieve.

Watch this space for more on deep learning, and it's applications.

Jack Cullen served as the President of Modis, an IT and Engineering Staffing firm, overseeing the North American region which eclipsed $1.3B in annual revenue. He joined the company in 1997 via the acquisition of his company, Technical Software Solutions, Inc, and assumed the role of President in November 2000. Jack has been in the Information Technology and Engineering Recruitment business since 1985 and previously worked for Johnson & Johnson and Colgate Palmolive before entering the Staffing and Consulting Industry. He is a graduate of the University of Maryland. Jack served on the Board of Trustees at the University of Maryland and led the campaign that raised $1B for The University.

Mr. Cullen is a former President of the Washington Chapter of the National Association of Computer Consulting Businesses (NACCB) and has been a speaker and panelist at their National Conference. Mr. Cullen has served as a keynote speaker at numerous conferences and conventions throughout his career. He is an avid fan of all things basketball and a recreational golf enthusiast.

Sonia is the Senior Vice President at Analysts. In her role, Sonia leads our recruiting operations and is responsible for strategic sales and business development. As the leader of recruiting operations and service delivery, Sonia oversees our global delivery centers for service excellence and has an unwavering commitment to quality. She runs a delivery organization that is client-centric, process-driven, and comprises of a highly skilled and qualified talent pool to exceed client delivery expectations. Sonia is also instrumental in shaping and nurturing strategic business alliances by reinforcing client relationships and responding to client requirements with the best approach.

Sonia joined Analysts in 2012 and has since, held multiple strategic and leadership positions in the company. She started as an IT program manager where she led digital transformation across our back office and financial systems. She then went on to play a key role in restructuring our Operations department by building a global operations strategy and implementing the plan across several geographies.

With almost 10 years in the industry, Sonia is a key driver of Analysts growth with proven expertise in areas of talent solutions and digital transformation across a variety of industries.

Sonia holds a Bachelor of Science degree in Industrial and System Engineering from the Georgia Institute of Technology.

Saurabh is responsible for business operations at Analysts. This includes managing the delivery organization across the US and our offshore locations. He leads the effort to strengthen our relationships within our existing Managed Services Providers (MSP) accounts and grow our footprint with leading MSPs.

As a Senior Sales and Operations Professional with twenty years of experience in leading diverse sales and operations teams, Saurabh understands how to deliver value to a range of industries including IT Consulting and Services, Shared Services, Management Consulting and Contingent Workforce Management. He has extensive experience with Managed Service Providers including Tapfin, GRI, Allegis, Workforce Logiq, Pontoon, Agile1, PRO Unlimited and the Bartech Impellam as well as experience managing a broad range of Vendor Management Systems including Fieldglass, Beeline, PeopleClick, IQ Navigator, Econometrix, Provade, Wand and Acceleration.

Saurabh has managed numerous large and small MSP run contingent workforce management programs covering a wide range of industries; including Banking & Financial Services, Private Equity, Supply Chain & Logistics, Life Sciences, Entertainment, Automotive, Insurance, Government and Securities.

Saurabh has an MBA from the University of Notre Dame, South Bend, IN. He lives in Atlanta, GA with his wife and two children. While not traveling to provide our clients with exemplary service he likes to spend time with his family. He is a sports enthusiast and a huge fan of the Fighting Irish!

As Group Vice President, Jennifer D’Silva is part of the Executive team at Analysts responsible for setting the vision, goals and strategic direction of the organization. She oversees all Sales and Business Development initiatives and is responsible for developing high performing cross-functional multi-disciplinary teams.

As an accomplished revenue leader with over 22 years at DATA Inc., Jennifer led the company’s transformation from a small regional staffing company to a global provider of Staffing solutions to numerous Fortune 500 companies. She has successfully spearheaded several Sales and Business Development initiatives and has built long-term relationships with clients through business consulting and solutions delivery. Jennifer combines market knowledge and an excellent analytical mindset with team-leading skills to contribute towards organizational growth. She has gained the respect and admiration of her teams by providing them with mentorship and leadership development.

Jennifer holds a Degree in Marketing specializing in Business Economics and Sales Management. She lives in New Jersey with her family including her husband and two children.

Melissa was promoted to the role of Group Vice President at Analysts in June 2021. In her current role, she will be responsible for hiring, training, managing, and building Analysts’ sales and delivery organization to support current clients and the acquisition and development of new clients. As Group Vice President, she will also drive and manage the overarching strategy for how to maximize Analysts’ expansion within a business line.

As an accomplished professional, she has achieved may milestones in the IT and Life Sciences Recruiting industries. Her professional journey from a Recruiting Coordinator to a Group Vice President speaks volumes about her abilities as a manger to her team members and thought leader. She has over 20 years of industry experience and a strong work ethic. She believes in cultivating strong relationships with both consultants as well as clients. With her focus on organization building, she has achieved every objective on time, be it professional or organizational.

Melissa holds a Bachelor’s degree in Communications from University of Colorado Boulder.

Joseph Nordlinger is the President of Analysts, the Talent Services and Staffing Division for ACS Solutions. He provides organizational leadership to the sales and delivery functions of Analysts with a focus on aligning the service architecture of Analysts to ensure our clients benefit from our combined on-shore and offshore delivery teams. Leading Talent Services includes Talent Solutions, Workforce Management Solutions, and Technology Solutions offerings for our clients around the world.

With almost 20 years in the talent services industry, he brings operational leadership in the areas of technology, life sciences, payroll services/IC compliance, contingent workforce management, project and portfolio management, and data management. Joseph has been a leader in the Human Services Industry for nearly 22 years. Through the acquisition of the company he founded, Joseph recently joined ACS. He leads Analysts with a firm belief in the power of human networks to achieve incredible results. In this spirit, he continues to develop exceptionally talented and committed leaders in order to deliver unique and innovative human services and solutions for our clients.

Joseph founded and leads three of the fastest growing professional associations for Data Management, Portfolio Management, and Contingent Workforce leaders (Data Management Professionals, Project Portfolio Management Professionals, and Contingent Workforce Professionals) with nearly 22,000 combined members. Joseph is an alumnus of the American Graduate School of International Management. He also has a Bachelor of Arts in International Political Economics from the University of California, San Diego. Joseph is an active-duty Volunteer Firefighter in Napa CA, the President of the Mt Veeder Fire Safe Council, and Vice President of Napa Firewise.

As Group Vice President for Analysts, Ruby is responsible for leading and growing talent services, professional, and managed IT Services across National Strategic Clients. She is responsible for strategic client identification, development, overall sales, delivery and consultant care within strategic accounts.

As a professional with more than 25 years of experience in IT consulting, Ruby started her career with American Cybersystems Inc. in 2001. Beginning as a Sales Executive, she moved up the value chain by assuming various roles and responsibilities in sales, strategic initiatives, and recruiting. At ACS, she has led several of the high growth accounts within telecom, financial services, and life sciences verticals. Prior to ACS, she worked for India’s leading system integrators, HCL and NIIT, selling IT hardware and software solutions to clients. Additionally, she has an excellent track record of building and leading diverse national sales teams.

Ruby has a BS in Chemistry and a MBA from Goizueta Business School of Emory University. She is a member of the Executive women of Goizueta in Business Development and Strategy. She is an avid reader of business journals and participates in events related to technology and diversity.

Paul Cmiel is a Group Vice President at Analysts and leads a team of sales directors located throughout the United States. He works closely with delivery professionals to ensure project success for our clients. The sales team is responsible for collaborating with clients to create customized, innovative solutions that focus on solving business challenges and delivering the highest quality outcomes.

Prior to joining Analysts in 2012, Paul was the Senior Director of a Global Healthcare Sales and Marketing organization within the IT consulting industry. In this role, he was responsible for working with sales and marketing teams to promote and drive new business in the healthcare market on both a national and global level.

Paul has more than 21 years of experience developing and marketing business and technical solutions. He is highly skilled in delivering solutions tied to client strategic initiatives and resulting in strong ROI. Paul holds a Bachelor of Science degree in Business Administration and Marketing from the University of
Wisconsin-La Crosse.

Jeff Hoekstra is responsible for the development and delivery of the company’s differentiated business offerings, with a focus on Digital Transformation solutions.

Jeff is a business development executive with more than 20 years of experience in business entrepreneurship, IT consulting, and software products and services development. Before joining the Analysts team in 2012, Jeff held a variety of Senior Account Executive positions where he identified, developed, and managed strategic relationships and provided direction and support to CIOs and senior IT executives. Jeff was also the founder and managing partner of several consulting firms, where he focused on business strategy development, market analysis, go-to-market strategy, and rapid application software development resources for small businesses and start-up organizations.

Jeff is a highly motivated strategic leader and results-driven professional with a broad business history in both domestic and global business-to-business environments. Jeff holds a Bachelor of Arts degree from the University of St. Thomas, with post-graduate work at Ohio University.

Tim Atkinson is responsible for the development and management of Analysts’ business and technology consulting and staffing services. He oversees client relations and market development, service delivery, H/R, recruiting, and business operations.

Tim has more than 30 years of experience in management and technology consulting, selling to and consulting across industry and business functions. Prior to joining Analysts (ACS Solutions), Tim was a Regional Vice President who delivered technology-enabled business results to commercial and public sector clients across the Mid-Atlantic area.

Tim has worked with some of the largest consulting firms in the world. He has a solid track record of well-managed growth, implementing innovative client solutions and market offerings, and building and developing high-performing teams.

Tim holds a Bachelor of Science degree in Economics and an MBA, both from the University of South Carolina. He is a former board member of the Greater Washington Board of Trade and the Fairfax County Chamber of Commerce. He is a member of the Northern Virginia Technology Council, the North American Telecommunications Association, and the U.S. Chamber
of Commerce.

Fundamentals: Artificial Neural Networks are to deep learning, what atoms are to matter

Blog

John P. ”Jack” Cullen

Sonia Sardana

Saurabh Pathak

Jennifer D’Silva

Melissa Douglas

Joseph Nordlinger

Ruby Pandit

Paul Cmiel

Jeff Hoekstra

Tim Atkinson