Tag Archives: technology education

The Field of AI (Part 05): AI APPROACHES AND METHODS

AI & Neural Networks

A Context with Histories and Definitions

You do not have to say good-bye to those cheap tadalafil pills awkward canicule of abortive acclamation and lower self-esteem. Poor or weak erections are also the symptoms of your ED, talk to your doctor who may recommend you kamagra or click here cheap levitra. Heartburn and bloating are two common symptoms that one can expertise for diverticulitis is tenderness within the decrease left facet of the stomach which could be cheap levitra 20mg mild or abruptly flares up to severe pain. This is buying online viagra the same chemical released by the brain that is formed when one consumes drugs.
Figure 01 An example artificial neural network with a hidden layer. Credit: en:User:Cburnett / CC BY-SA (http://creativecommons.org/licenses/by-sa/3.0/) Retrieved on March 12, 2020 from here

A beautiful and clearly-explained introduction to Neural Networks is offered in a 20 minute video by Grant Sanderson in his “3Blue1Brown” series.[1] One is invited to view this and his other enlightening pieces.

The traditional Artificial Neural Network (ANN)[2] is, at a most basic level, a kind of computational model for parallel computing between interconnected units.  One unit could be given more or less numerical ‘openness’ (read: weight & bias)[3] then another unit, via the connections created between the units. This changing of the weight and the bias of a connection (which means, the allocation of a set of numbers, turning them up or down as if changing a set of dials), is the ‘learning’ of the network by means of a process, through a given algorithm. These changes (in weight and bias) will influence which signal will be propagated forwardly to which units in the network. This could be to all units (in a traditional sense) or to some units (in a more advanced development of a basic neural network, e.g. such as with Convoluted Neural Networks).[4] An algorithm processes signals through this network. At the input or inputs (e.g. the first layer) the data is split across these units. Each unit within the network can hold a signal (e.g. a number) and contains a computational rule, allowing activation (via a set threshold controlled by, for instance, a sigmoid function, or the recently more often applied “Rectified Linear Unit,” or ReLu for short), to send through a signal (e.g. a number) over that connection to a next unit or to a number of following units in a next layer (or output). The combination of all the units, connections and layers might allow the network to label, preferably correctly, the entirety of a focused-on object, at the location of the output or outputs layer. The result is that the object has been identified (again, hopefully, correctly or, at least, according to the needs).

The signal (e.g. a number) could be a representation of, for instance, 1 pixel in an image.[5] Note, an image is digitally composed of many pixels. One can imagine many of these so-called ‘neurons’ are needed to process only 1 object (consisting of many pixels)  in a snapshot of only the visual data (with possibly other objects and backgrounds and other sensory information sources) from an ever changing environment, surrounding an autonomous technology, operated with an Artificial Neural Network. Think of a near-future driverless car driving by in your street. Simultaneously, also imagine how many neurons and even more connections between neurons, a biological brain, as part of a human, might have. Bring to mind a human (brain) operating another car driving by in your street. The complexity of the neural interconnected working, the amount of data to be processed (and to be ignored) might strike one with awe.

The oldest form of such artificial network is a Single-layer Perceptron Network, historically followed by the Multilayer Perceptron Network. One could argue that ‘ANN’ is a collective name for any network that has been artificially made and that has been exhibiting some forms and functions of connection between (conceptual) units.

An ANN was initially aimed (and still is) at mimicking (or modeling, or abstracting) the brain’s neural network (i.e. the information processing architecture in biological learning systems).

Though, the term, Artificial Neural Network, contains the word ‘neural’, we should not get too stuck on the brain-like implications of this word which is derived from the word ‘neuron’. The word ‘neuron’ is not a precise term in the realm of AI and its networks. At times instead of ‘neuron’ the word ‘perceptron’ has been used, especially when referring to as specific type of (early) artificial neural network using thresholds (i.e. a function that allows for the decision to let a signal through or not; for instance, the previously-mentioned sigmoid function).

Figure 02 an electron microscope. Connections between brain cells, into a large neural network, were identified with an older version of this technology.

Image license and attribution: David J Morgan from Cambridge, UK / CC BY-SA (https://creativecommons.org/licenses/by-sa/2.0) Retrieved on April 23, 2020 from here

Nevertheless, maybe some brainy context and association might spark an interest in one or other learner. It might spark a vision for future research and development to contextualize these artificial networks by means of analogies with the slightly more tangible biological world. After all, these biological systems we know as brains, or as nervous systems, are amazing in their signal processing potentials. A hint of this link can also be explored in Neuromorphic Engineering and Computing.

The word “neuron” comes from Ancient Greek and means ‘nerve’. A ‘neuron,’ in anatomy or biology at large, is a nerve cell within the nervous system of a living organism (of the animal kingdom, but not sponges), such as mammals (e.g. humans). By means of small electrical (electro-chemical) pulses (i.e. nerve impulses), these cells communicate with other cells in the nervous system. Such a connection, between these types of cells, is called a synapse. Note, neurons cannot be found among fungi nor among plants (these do exchange signals, even between fungi and plants yet, in different chemical ways)… just maybe they are a steppingstone for one or other learner to imagine an innovative way to process data and compute outputs!

The idea here is that a neuron is “like a logic gate [i.e. ‘a processing element’] that receives input and then, depending on a calculation, decides either to fire or not.[6] Here, the verb “to fire” can be understood as creating an output at the location of the individual neuron. Also note, that a “threshold” is again implied here.

An Artificial Neural Network can then be defined as “…а computing ѕуѕtеm made up of a number of ѕimрlе, highlу intеrсоnnесtеd рrосеѕѕing elements, which рrосеѕѕ infоrmаtiоn by thеir dуnаmiс ѕtаtе response to еxtеrnаl inputs.”[7]

Remember, ‘neuron’, in an ANN, it should be underlined again, is a relatively simple mathematical function. It is, in general, agreed that this function is analogous to a node. Therefore, one can state that an Artificial Neural Network is built up of layers of interconnected nodes.[8] So, one can notice, in or surrounding the field of AI, that words such as unit, node, neuron or perceptron used interchangeably, while these are not identical in their deeper meaning. More recently the word “capsule” has been introduced, presenting an upgraded version of the traditional ‘node,’ the latter equaling one ‘neuron.’ Rather, a capsule is a node in a network equaling a collection of neurons.[9] A little bit of additional information on this can be found here below.

How could an analogy with the brain be historically contextualized? In the early 1950s, with the use of electron microscopy, it was proven that the brain exists of cells, which preceding were labelled as “neurons”.[10] It unequivocally showed the interconnectedness (via the neuron’s extensions, called axons and dendrites) between these neurons, into a network of a large number of these cells. A single of these type of locations of connection between neurons has been labeled as a “synapse”.

Since then it has been established that, for instance, the human cerebral cortex contains about 160 trillion synapses (that’s a ‘160’ and another 12 zeros: 160000000000000) between about a 100 billion neurons (100000000000). Synapses are the locations between neurons where the communication between the cells is said to occur.[11] In comparison some flies have about 100000 neurons and some worms a few hundreds.[12] The brain is a “complex, nonlinear, and parallel computer (information-processing system)”.[13] The complexity of the network comes with the degree of interconnectedness (remember, in a brain that’s synapses).

Whereas it is hard for (most) humans to multiply numbers at astronomically fast speeds, it is easy for a present-day computer. While it is doable for (most) humans to identify what a car is and what it might be doing next, this is (far) less evident for a computer to (yet) handle. This is where, as one of many examples, the study and developments of neural networks (and now also Deep Learning) within the field of AI has come in handy, with increasingly impressive results. The work is far from finished and much can still be done.

The field of study of Artificial Neural Networks is widely believed to have started a bit earlier than the proof of connectivity of the brain and its neurons. It is said to have begun with the 1943 publication by Dr. McCulloh and Dr. Pitts, and their Threshold Logic Unit (TLU). It was then followed by Rosenblatt’s iterations of their model (i.e. the classical perceptron organized in a single layered network) which in turn was iterated upon by Minsky and Papert. Generalized, these were academic proposals for what one could understand as an artificial ‘neuron’, or rather, a mathematical function that aimed to mimic a biological neuron, and the network made therewith, as somewhat analogously found within the brain.[14]

Note, the word ‘threshold’ is of use to consider a bit further. It implies some of the working of both the brain’s neurons and of ANNs’ units. A threshold in these contexts, implies the activation of an output if the signal crosses the mathematically-defined “line” (aka threshold). Mathematically, this activation function can be plotted by, for instance, what is known as a sigmoid function (presently less used). The sigmoid function was particularly used in the units (read in this case: ‘nodes’ or ‘neurons’ or ‘perceptrons’) of the first Deep Learning Artificial Neural Networks. Presently, the sigmoid function is at times being substituted with improved methods such as what is known as “ReLu” which is short for ‘Rectified Linear Unit’. The latter is said to allow for better results and is said to be easier to manage in very deep networks.[15]

Turning back to the historical narrative, it was but 15 years later than the time of the proposal of the 1943 Threshold Logic Unit, in 1958, with Rosenblatt’s invention and hardware design of the Mark I Perceptron —a machine aimed at pattern recognition in images (i.e. image recognition) — that a more or less practical application of such network had been built.[16] As suggested, this is considered being a single-layered neural network.

This was followed by a conceptual design from Minsky and Papert, considering the multilayered perceptron (MLP), using a supervised learning technique. The name gives it away, this is the introduction of the multi-layered neural network. While hinting at nonlinear functionality,[17] yet this design was still without the ability to perform some basic non-linear logical functions. Nevertheless, the MLP was forming the basis for the neural network designs as they are developed presently. Presently, Deep Learning research and development has advanced beyond these models.

Simon Haykin puts it with a slight variation in defining a neural network when he writes that it is a “massively parallel distributed processor, made up of simple processing units, that has a natural propensity for storing experiential knowledge and making it available for use. It resembles the brain in two respects: 1. Knowledge is acquired by the network from its environment through a learning process. 2. Inter-neuron connection strengths, known as synaptic weights, are used to store the acquired knowledge.”[18]

Let us shortly touch on the process of learning in the context of an ANN and that with a simplified analogy. One way to begin understanding the learning process, or training, of these neural networks, in a most basic sense, can be done by looking at how a network would (ignorantly) guess the conversion constant between kilometers and miles without using algebra. One author, Tariq Rashid, offers the following beautifully simple example in far more detail. The author details an example where one can imagine the network honing in on the conversion constant between, for instance, kilometers and miles.

Summarized here: The neural network could be referencing examples. Let us, as a simple example, assume it ‘knows’ that 0 km equals 0 miles. It also ‘knows’, from another given example, that 100 km is 62.137 miles. It could ‘guess’ a number for the constant, given that it is known that 100 (km) x constant = some miles. The network randomly could, very fast, offer a pseudo-constant guessed as 0.5. Obviously, that would create an error compared to the given example. In a second guess it could offer 0.7. This would create a different kind of error. The first is too small and the second is too large. The network consecutively undershot and then overshot the needed value for the constant.

By repeating a similar process, whereby a next set of numbers (= adjusted parameters internal to the network) is between 0.5 and 0.7 with one closer to the 0.5 and the others closer to 0.7, the network gets closer in estimating the accurate value for its needed output (e.g. 0.55 and 0.65; then next 0.57 and 0.63, and so on). The adjusting of the parameters would be decided by how right or wrong the output of the network model is compared to the known example that is also known to be true (e.g. a given data set for training). Mr. Rashid’s publication continues the gentle introduction into supervised training and eventually building an artificial neural network.

In training the neural network to become better at giving the desired output, the network’s weights and biases (i.e. its parameters) are tweaked. If the output has a too large an error, the tweaking processes is repeated until the error in the output is acceptable and the network has turned out to be a workable model to make a prediction or give another type of output.

In the above example one moves forward and backward until the smallest reasonable error is obtained. This is, again somewhat over-simplified how a backpropagation algorithm functions in the training process of a network towards making it a workable model. Note, “propagate” means to grow, extend, spread, reproduce (which, inherently, are forward movements over time).

These types of network, ANNs or other, are becoming both increasingly powerful and diversified. They also are becoming increasingly accurate in identifying and recognizing patterns in certain data sets of visual (i.e. photos, videos), audio (i.e. spoken word, musical instruments, etc.) or other nature. These are becoming more and more able to identify patterns, as well as humans are able to and beyond what humans are able to handle.[19]

Dr. HINTON, Geoffrey[20]  is widely considered as one of the leading academics in Artificial Neural Networks (ANNs) and specifically seen as a leading pioneer in Deep Learning.[21] Deep Learning, a type of Machine Learning, is highly dependent on various types of Artificial Neural Network.[22] Dr. Hinton’s student, Alex Krizhevsky, noticeably helped to boost the field of computer vision by winning the 2012 ImageNet Competition and this by being the first to use a neural network.

To round the specific ‘ANN’ introduction up, let us imagine, perhaps in the processes of AI research and specifically in its area similar to those of ANNs, solutions can be thought up or are already being thought of that are less (or more) brain-like or for which the researchers might feel less (or more) of a need to make an analogy with a biological brain. Considering processes of innovation, one might want to keep an open-mind to these seemingly different meanderings of thought and creation.

Going beyond the thinking of ANNs, one might want to fine-tune an understanding and also consider diversity in forms and functions of these or other such networks. There are, for instance, types going around with names such as ‘Deep Neural Networks’ (DNNs) which, are usually extremely large and are usually applied to process very large sets of data.[23] One can also find terminologies such as the ‘Feedforward Neural Networks’ (FFNNs), which is said to be slightly more complex than the traditional and old-school perceptron networks;[24] ‘Convolutional Neural Networks’ (CNNs), which are common in image recognition; ‘Recurrent Neural Networks’ (RNNs) and its sub-type of ‘Long Short-term Memory’ networks (LSTM), which apply feedback connection and which are used in Natural Language Processing. These latter networks are claimed to still apply sigmoid functions, contrary to the increased popularity of other functions.[25] All of these and more are studied and developed in the fields of Machine Learning and Deep Learning. All these networks would take us rather deep into the technicalities of the field. You are invited to dig deeper and explore some of the offered resources.

It might be worthwhile to share that CNN solutions are particularly well-established in computer vision. The neurons specialized in the visual cortex of the brain and how these do or do not react to the stimuli coming into their brain region from the eyes, were used as an inspiration in the development of the CNN. This design helped to reduce some of the problems that were experienced with the traditional artificial neural networks. CNNs do have some shortcomings, as many of these cutting-edge inventions stil need to be further researched and fine-tuned.[26]

Capsule Networks (CapsNets)

In the process of improvement, innovation and fine-tuning, there are new networks continuously being invented. For instance, in answering some of the weaknesses of ‘Convolutional Neural Networks’ (CNNs), the “Capsule Networks (CapsNets)” are a relative recent invented answer, from a few years ago, by Hinton and his team.[27] It is also believed that these CapsNets mimic better how a human brain processes vision then what the CNNs have been enabled to offer up till now.

To put it too simple, it’s an improvement onto the previous versions of  nodes in  a network (a.k.a. ‘neurons’) and a neural network. It tries to “perform inverse graphics”, where inverse graphics is a process of extracting parameters from a visual that can identify location of an object within that visual. A capsule is a function that aids in the prediction of the “presence and …parameters of a particular object at a given location.[28] The network hints at outperforming the traditional CNN in a number of ways such as the increased ability to identify additional yet functional parameters associated with an object. One can think of orientation of an object but also of its thickness, size, rotation and skew, spatial relationship, to name but a few.[29] Although a CNN can be of use to identify an object, it cannot offer an identification of that object’s location. Say a mother with a baby can be identified. The CNN cannot support the identification whether they are on the left of one’s visual field versus the same humans but on the right side of the image.[30] One might imagine the eventual use of this type of architectures in, for instance, autonomous vehicles.

Generative Adversarial Networks (GANs)

This type of machine learning method, a Generative Adversarial Network (GAN), was invented in 2014 by Dr. Goodfellow and Dr. Bengio, among others.[31]

Figure 03 This young lady does not (exactly) physically exist. The image was created by a GAN; a StyleGAN based on the analysis of photos of existing individuals. Source: public domain (since it is created by an AI method, and a method is not a person, it is not owned). Retrieved March 10, 2020 from here

It’s an unsupervised learning technique that allows to go beyond historical data (note, it is debatable that, most if not all data is inherently historical from the moment following its creation). In a most basic sense, it is a type of interaction, by means of algorithms (i.e. Generative Algorithms), between two Artificial Neural Networks.

The GANs allow to create new data (or what some refer to as “lookalike data”)[32] by applying features, by means of certain identified features, from the historical referenced data. For instance, a data set, existing of what we humans perceive as images, and then of a specific style, can allow this GANs’ process to generate a new (set of) image(s) in the style of the studied set. Images are but one media. It can handle digital music, digitized artworks, voices, faces, video… you name it. It can also cross-pollinate between media types, resulting in a hybrid between a set of digitized artworks and a landscape, resulting in a landscape “photo” in a style of the artwork data set. The re-combinations and reshufflings are quasi unlimited. Some more examples are of GANs types are those that can

  • …allow for black and white imagery to be turned into colorful ones in various visual methods and styles.[33]
  • …turn descriptive text of, say different birds into photo-realistic bird images.[34]
  • …create new images of food based on their recipes and reference images.[35]
  • …turn a digitized oil painting into a photo-realistic version of itself; turning a winter landscape into a summer landscape, and so on.[36]

If executed properly, for instance, the resulting image could make an observer (i.e. a discriminator) decide that the new image (or data set) is as (authentic as) the referenced image(s) or data set(s) (note: arguably, in the digital or analog world, an image or any other media of content is a data set).

It is also a technique whereby two neural networks contest with each other. They do so in a game-like setting as it is known in the mathematical study of models of strategic decision-making, entitled “Game Theory.” Game Theory is not to be confused with the academic field of Ludology, the latter which is the social, anthropological and cultural study of play and game design. While often one network’s gain is the other network’s loss (i.e. a zero-sum game), this is not always necessarily the case with GANs.

It is said that GANs can also function and offer workable output with relatively small data sets (which ic an advantage compared to some other techniques).[37]

It has huge applications in the arts, advertising, film, animation, fashion design, video gaming, etc. These professional fields are each individually known as multi-billion dollar industries. Besides entertainment it is also of use in the sciences such as physics, astronomy and so on.

Applications

One can learn how to understand and build ANNs online via a number of resources. Here below are a few hand-picked projects that might offer a beginner’s taste to the technology.

Project #___: Making Machine Learning Neural Networks (for K12 students by
Oxford University)
Project source: 
https://ecraft2learn.github.io/ai/AI-Teacher-Guide/chapter-6.html
Project #___: Rashid, T. (2016). Make Your Own Neural Network
A project-driven book examining the very basics of neural networks and aiding a learning step by step into creating a network. Published as eBook or paper via CreateSpace Independent Publishing Platform.
This might be easily digested by Middle Schools students or learners who cannot spend too much effort yet do want to learn about neural networks in an AI context.
information retrieved on April 2, 2020 from http://makeyourownneuralnetwork.blogspot.com/
Project #___:  A
small example: Training a model to estimate square roots (click on the image to
enter the SNAP! environment)
Project source: 
https://ecraft2learn.github.io/ai/AI-Teacher-Guide/chapter-6.html
Project #___:  Training
a model to label data (click on the image to enter the SNAP! environment)
Project source: 
https://ecraft2learn.github.io/ai/AI-Teacher-Guide/chapter-6.html
Project #___:  Training
a model to predict how you would rate abstract "art"
Project source: 
https://ecraft2learn.github.io/ai/AI-Teacher-Guide/chapter-6.html
Project #___: A Neural Network to recognize hand-written digits
This
project comes with an online book and code by Michael Nielsen. 
Source code: 
https://github.com/mnielsen/neural-networks-and-deep-learning  
Updated source code: https://github.com/MichalDanielDobrzanski/DeepLearningPython35 
Database:
http://yann.lecun.com/exdb/mnist/ (a training set of 60,000 examples, and a test set of 10,000 examples)
Study material: 
http://neuralnetworksanddeeplearning.com/chap1.html   
Project #___: MuZero: Build a Neural Network using Python[1] 
project source: https://medium.com/applied-data-science/how-to-build-your-own-muzero-in-python-f77d5718061a
[1] Schrittwieser, J. et al. (2020). Mastering Atari, Go, Chess and Shogi by
Planning with a Learned Model. Online: arXiv.org, Cornell University;
Retrieved on April 1, 2020 from  https://arxiv.org/abs/1911.08265 

References & URLs

[1]Sanderson, G.  (? Post-2016).  3BLUE1BROWN SERIES. But what is a Neural Network? | Deep Learning, chapter 1.  S3 • E1 (Video). Online. Retrieved on April 22, 2020 from https://www.bilibili.com/video/BV12t41157gx?from=search&seid=15254673027813667063 AND the entire series: https://search.bilibili.com/all?keyword=3Blue1Brown&from_source=nav_suggest_new AND https://www.youtube.com/watch?v=aircAruvnKk Information Retrieved from https://www.3blue1brown.com/about

[2] Nielsen, M. (2019). Neural Networks and Deep Learning. Online: Determination Press. Retrieved on April 24, 2020 from http://neuralnetworksanddeeplearning.com/  AND https://github.com/mnielsen/neural-networks-and-deep-learning AND http://michaelnielsen.org/

[3] Marsland, S. (2015). Machine Learning. An Algorithmic Perspective. Boca Raton, FL, USA: CRC Press. p.14

[4] Charniak, E. (2018). Introduction to Deep Learning. Cambridge, MA: The MIT Press p.51

[5] Sanderson, G.  (? Post-2016). 

[6] Du Sautoy, M. (2019). The Creative Code. How AI is Learning to Write, Paint and Think. London: HarperCollins Publishers. pp.117

[7]Dr. Hecht-Nielsen, Robert in Caudill, M. (December, 1987).  “Neural Network Primer: Part I”. in AI Expert Vol. 2, No. 12, pp 46–52. USA:      Miller Freeman, Inc. Information Retrieved on April 20, 2020 from https://dl.acm.org/toc/aiex/1987/2/12 and  https://dl.acm.org/doi/10.5555/38292.38295 ; citation retrieved from https://www.oreilly.com/library/view/hands-on-automated-machine/9781788629898/0444a745-5a23-4514-bae3-390ace2dcc61.xhtml

[8] Rashid, T.  (2016). Make Your Own Neural Network.  CreateSpace Independent Publishing Platform

[9] Sabour, S. et al. (2017). Dynamic Routing Between Capsules. Online: arXiv.org, Cornell University; Retrieved on April 22, 2020 from https://arxiv.org/pdf/1710.09829.pdf

[10] Sabbatini, R. (Feb 2003). Neurons and Synapses. The History of Its Discovery. IV. The Discovery of the Synapse. Online: cerebromente.org. Retrieved on April 23, 2020 from http://cerebromente.org.br/n17/history/neurons4_i.htm

[11] Tang Y. et al (2001). Total regional and global number of synapses in the human brain neocortex. In Synapse 2001;41:258–273.

[12] Zheng, Z., et al. (2018). A Complete Electron Microscopy Volume of the Brain of Adult Drosophila melanogaster. In Cell, 174(3), 730–743.e22

[13] Haykin, S. (2008). Neural Networks and Learning Machines. New York: Pearson Prentice Hall. p.1

[14] McCulloch, W.. & Pitts, W. (1943; reprint: 1990). A Logical Calculus of the Ideas Immanent in Nervous Activity. Bulletin of Mathematical Biophysics, Vol. 5, pp.115-133. Retrieved online on February 20, 2020 from  https://www.cs.cmu.edu/~./epxing/Class/10715/reading/McCulloch.and.Pitts.pdf  

[15] Sanderson, G.  (? Post-2016). 

[16] Rosenblatt, F. (January, 1957). The Perceptron. A Perceiving and Recognizing Automaton. Report No. 85-460-1. Buffalo (NY): Cornell Aeronautical Laboratory, Inc. Retrieved on January 17, 2020 from https://blogs.umass.edu/brain-wars/files/2016/03/rosenblatt-1957.pdf 

[17] Samek, W. et al (2019). Explainable AI: Interpreting, Explaining and Visualizing Deep Learning. Lecture Notes in Artificial Intelligence. Switzerland: Springer. p.9

[18] Haykin, S. (2008). p.2

[19] Gerrish, S. (2018). How Smart Machines Think. Cambridge, MA: The MIT Press. pp. 18

[20] Gerrish, S. (2018). pp. 73

[21] Rumelhart, David E.; Hinton, Geoffrey E.; Williams, Ronald J. (9 October 1986). “Learning representations by back-propagating errors”. Nature. 323 (6088): 533–536

[22] Montavon, G. et al. (2012). Neural Networks: Tricks of the Trade. New York: Springer. Retrieved on March 27, 2020 from https://link.springer.com/book/10.1007/978-3-642-35289-8 AND https://machinelearningmastery.com/neural-networks-tricks-of-the-trade-review/   

[23] de Marchi, L. et al. (2019). Hands-on Neural Networks. Learn How to Build and Train Your First Neural Network Model Using Python. Birmingham & Mumbai: Packt Publishing. p. 9.

[24] Charniak, E. (2018). Introduction to Deep Learning. Cambridge, MA: The MIT Press. p. 10

[25] de Marchi, L. et al. (2019). p. 118-119.

[26] Géron, A. (February, 2018). Introducing capsule networks. How CapsNets can overcome some shortcomings of CNNs, including requiring less training data, preserving image details, and handling ambiguity. Online: O’Reilly Media. Retrieved on April 22, 2020 from https://www.oreilly.com/content/introducing-capsule-networks/

[27] Sabour, S. et al. (2017)

[28] Géron, A. (2017). Capsule Networks (CapsNets) – Tutorial (video). Retrieved on April 22, 2020 from https://www.bilibili.com/video/av17961595/ AND  https://www.youtube.com/watch?v=pPN8d0E3900

[29] Géron, A. (February, 2018).

[30] Tan, K. (November, 2017).  Capsule Networks Explained. Online. Retrieved on April 22, 2020 from https://kndrck.co/posts/capsule_networks_explained/ AND https://gist.github.com/kendricktan/9a776ec6322abaaf03cc9befd35508d4

[31] Goodfellow, I. et al. (June 2014). Generative Adversarial Nets. Online: Neural Information Processing Systems Foundation, Inc  Retrieved on March 11, 2020 from https://papers.nips.cc/paper/5423-generative-adversarial-nets.pdf AND Online: arXiv.org, Cornell University;  https://arxiv.org/abs/1406.2661

[32] Skanski, S. (2020). Guide to Deep Learning. Basic Logical, Historical and Philosophical Perspectives. Switzerland: Springer Nature. p. 127

[33] Isola, P. et al. (2016, 2018). Image-to-Image Translation with Conditional Adversarial Networks. Online: arXiv.org, Cornell University; Retrieved on April 16, 2020 from https://arxiv.org/abs/1611.07004

[34] Zhang, H. et al. (2017). StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks. Online: arXiv.org, Cornell University; Retrieved on April 16, 2020 from https://arxiv.org/pdf/1612.03242.pdf

[35] Bar El, O. et al. (2019). GILT: Generating Images from Long Text. Online: arXiv.org, Cornell University; Retrieved on April 16, 2020 from https://arxiv.org/abs/1901.02404

[36] Zhu, J. (2017). Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. Online: arXiv.org, Cornell University; Retrieved on April 16, 2020 from https://arxiv.org/pdf/1703.10593.pdf

[37] Skanski, S. (2020). p.127

The Field of AI (part 02): A “Pre-History” & a Foundational Context.

last update: Friday, April 24, 2020

URLs for A “Pre-History” & a Foundational Context:

  • This post is the main post on a Pre-History & a Foundational context of the Field of AI. In this post a narrative is constructed surrounding the “Pre-History”. It links with the following posts:
  • This post is a first and very short linking with on Literature, Mythology & Arts as one of the foundational contexts of the Field of AI
  • The second part in the contextualization is the post touching on a few attributes from Philosophy, Psychology and Linguistics
  • Following one can read about very few attributes picked up on from Control Theory as contextualizing to the Field of AI
  • Cognitive Science is the fourth field that is mapped with the Field of AI.
  • Mathematics & Statistics is in this writing the sixth area associated as a context to the Field of AI
  • Other fields contextualizing the Field of AI are being considered (e.g. Data Science & Statistics, Economy, Engineering fields)
According to a blog associated with the Institute for Marital Healing, parents can at times be the culprit: not by treating the deeprootsmag.org prescription de viagra canada child badly, but the opposite, being too permissive and indulging a child’s selfishness. With the help of male enhancement pills, you will cherish your decision throughout your cheapest viagra for sale life. The natural means of curing erectile dysfunction cialis online australia include minimizing fat in your diet, cutting down alcohol intake, reducing cholesterol levels in the body, and exercising regularly. Go For Only Genuine Products A safe and original medication helps to lead a successful ED treatment. viagra price uk

The Field of AI: A “Pre-History”.

A “pre-history” and a foundational context of Artificial Intelligence can arguably by traced back to a number of events in the past as well as to a number of academic fields of study. In this post only a few have been handpicked.

This post will offer a very short “pre-history” while following posts will dig into individual academic fields that are believed to offer the historical and present-day context for the field of AI.

It is not too far-fetched to link the roots of AI, as the present-day field of study, with the human imagination of artificial creatures referred to as “automatons” (or what could be understood as predecessors to more complex robots).

While it will become clear here that the imaginary idea of automatons in China is remarkably older, it has been often claimed that the historic development towards the field of AI, as it is intellectually nurtured today, commenced more than 2000 years ago in Greece, with Aristotle and his formulation of the human thought activity known as “Logic”.

Presently, with logic, math and data one could make a machine appear to have some degree of “intelligence”. Note, it is rational to realize that the perception of an appearance does not mean the machine is intelligent. What’s more, it could be refreshing to consider that not all intelligent activity is (intended to be seen as) logical.

It’s fun, yet important, to add that to some extent, initial studies into logic could asynchronously be found in China’s history with the work by Mòzǐ (墨子), who conducted his philosophical reflections a bit more than 2400 years ago. 

Coming back to the Ancient Greeks: besides their study of this mode of thinking, they also experimented with the creation of basic automatons.

Automatons (i.e. self-operating yet artificial mechanical creatures) were likewise envisioned in China and some basic forms were created in its long history of science and technology.[1] An early mentioning can be found in, Volume 5 “The Questions of Tang” (汤问; 卷第五 湯問篇) of the Lièzǐ (列子)[2], an important historical Daoist text.

In this work there is mentioning of this kind of (imagined) technologies or “scientific illusions”.[3] The king in this story became upset by the appearance of intelligence and needed to be reassured that the automaton was only that, a machine …

Figure 1 King of Zhōu, who reigned a little more than 2950 years ago ( 周穆王; Zhōu Mù Wáng ) , introduced by Yen Shi, is meeting an automaton (i.e. the figure depicted with straighter lines, on the top-left), as mentioned in the fictional book Lièzǐ. Image retrieved on March 5, 2020 from here
Figure 2 Liè Yǔkòu (列圄寇/列禦寇), aka the Daoist philosopher Lièzĭ (列子) who imagined an (artificial) humanoid automaton. This visual was painted with “ink and light colors on gold-flecked paper,” by Zhāng Lù (张路); during the Míng Dynasty (Míng cháo, 明朝; 1368–1644). Retrieved on January 12, 2020 from here ; image license: public domain.

Jumping forward to the year 1206, the Arabian inventor, Al-Jazari, supposedly designed the first programmable humanoid robot in the form of a boat, powered by water flow, and carrying four mechanical musicians. He wrote about it in his work entitled “The Book of Knowledge of Ingenious Mechanical Devices.

It is believed that Leonardo Da Vinci was strongly influenced by his work.[4] Al-Jazari additionally designed clocks with water or candles. Some of these clocks could be considered programmable in a most basic sense.

figure 3 Al-jazari’s mechanical musicians machine (1206). Photo Retrieved on March 4, 2020 from here; image: public domain

One could argue that the further advances of the clock (around the 15th and 16th century) with its gear mechanisms, that were used in the creation of automatons as well, were detrimental to the earliest foundations, moving us in the direction of where we are exploring AI and (robotic) automation or autonomous vehicles today.

Between the 16th and the 18th centuries, automatons became more and more common.  René Descartes, in 1637, considered thinking machines in his book entitled “Discourse on the Method of Reasoning“. In 1642, Pascal created the first mechanical digital calculating machine.

Figure 4 Rene Descartes; oil on canvas; painted by Frans Hals the Elder (1582 – 1666; A painter from Flanders, now northern Belgium, working in Haarlem, the Netherlands. This work: circa 1649-1700; photographed by André Hatala . File retrieved on January 14, 2020 from here. Image license: public Domain

Between 1801 and 1805 the first programmable machine was invented by Joseph-Marie Jacquard. He was strongly influenced by Jacques de Vaucanson with his work on automated looms and automata. Joseph-Marie’s loom was not even close to a computer as we know it today. It was a programmable loom with punched paper cards that automated the action of the textile making by the loom. What is important here was the system with cards (the punched card mechanism) that influenced the technique used to develop the first programmable computers.

Figure 5 Close-up view of the punch cards used by Jacquard loom on display at the Museum of Science and Industry in Manchester, England. This public domain photo was retrieved n March 12, 2020 from here; image: public domain

In the first half of the 1800s, the Belgian mathematician, Pierre François Verhulst discovered the logistic function (e.g. the sigmoid function),[1] which will turn out to be quintessential in the early-day developments of Artificial Neural Networks and specifically those called “perceptrons” with a threshold function, that is hence used to activate the output of a signal, and which operate in a more analog rather than digital manner, mimicking the biological brain’s neurons. It should be noted that present-day developments in this area do not only prefer the sigmoid function and might even prefer other activation functions instead.


[1] Bacaër, N. (2011). Verhulst and the logistic equation (1838). A Short History of Mathematical Population Dynamics. London: Springer. pp. 35–39.  Information retrieved from https://link.springer.com/chapter/10.1007%2F978-0-85729-115-8_6#citeas and from mathshistory.st-andrews.ac.uk/Biographies/Verhulst.html  

In 1936 Alan Turing proposed his Turing Machine. The Universal Turing Machine is accepted as the origin of the idea of a stored-program computer. This would later, in 1946, be used by John von Neumann for his “Electronic Computing Instrument“.[6] Around that same time the first general purpose computers started to be invented and designed. With these last events we could somewhat artificially and arbitrarily claim the departure from “pre-history” into the start of the (recent) history of AI.

figure 6 Alan Turing at the age of 16. Image Credit: PhotoColor [CC BY-SA (https://creativecommons.org/licenses/by-sa/4.0)] ; Image source Retrieved April 10, 2020 from here


As for fields of study that have laid some “pre-historical” foundations for AI research and development, which continue to be enriched by AI or that enrich the field of AI, there are arguably a number of them. A few will be explored in following posts. The first posts will touch on a few hints of Literature, Mythology and the Arts.


[1] Needham, J. (1991). Science and Civilisation in China: Volume 2, History of Scientific Thought. Cambridge, UK: Cambridge University.

[2] Liè Yǔkòu (列圄寇 / 列禦寇). (5th Century BCE). 列子 (Lièzǐ). Retrieved on March 5, 2020 from https://www.gutenberg.org/cache/epub/7341/pg7341-images.html  and 卷第五 湯問篇 from https://chinesenotes.com/liezi/liezi005.html   and an English translation (not the latest) from  https://archive.org/details/taoistteachings00liehuoft/page/n6/mode/2up  

[3] Zhāng, Z. (张 朝 阳).  ( November 2005). “Allegories in ‘The Book of Master Liè’ and the Ancient Robots”. Online: Journal of Heilongjiang College of Education. Vol.24 #6. Retrieved March 5, 2020 from https://wenku.baidu.com/view/b178f219f18583d049645952.html

[4] McKenna, A. (September 26, 2013). Al-Jazarī Arab inventor. In The Editors of Encyclopaedia Britannica. Online: Encyclopaedia Britannica Retrieved on March 25, 2020 from https://www.britannica.com/biography/al-Jazari AND:

Al-Jazarī, Ismail al-Razzāz; Translated & annotated by Donald R. Hill. (1206). The Book of Knowledge of Ingenious Mechanical Devices. Dordrecht, The Netherlands: D. Reidel Publishing Company. Online Retrieved on March 25, 2020 from https://archive.org/details/TheBookOfKnowledgeOfIngeniousMechanicalDevices/mode/2up

[5] Bacaër, N. (2011). Verhulst and the logistic equation (1838). A Short History of Mathematical Population Dynamics. London: Springer. pp. 35–39.  Information retrieved from https://link.springer.com/chapter/10.1007%2F978-0-85729-115-8_6#citeas and from mathshistory.st-andrews.ac.uk/Biographies/Verhulst.html

[6] Davis, M. (2018). The Universal Computer: the road from Leibniz to Turing. Boca Raton, FL: CRC Press, Taylor & Francis Group

The Field of AI (part 01): Context, Learning & Evolution

One could state that Artificial Intelligence (AI) methods enable the finding of and interaction with patterns in the information available from contexts to an event, object or fact. These can be shaped into data points and sets. Many of these sets are tremendously large data sets. So large are these pools of data, so interconnected and so changing that it is not possible for any human to see the patterns that are actually there or that are meaningful, or that can actually be projected to anticipate the actuality of an imagined upcoming event.

While not promising that technologies coming out from the field of AI are the only answer, nor the answer to everything, one could know their existence and perhaps apply some of the methods used in creating them. One could, furthermore, use aspects from within the field of AI to learn about a number of topics, even about the processes of learning itself, about how to find unbiased or biased patterns in the information presented to us. Studying some basics about this field could offer yet another angle of meaning-giving in the world around and within us.  What is a pattern, if not an artificial promise to offer some form of meaning?

It’s not too far-fetched to state that the study of Artificial Intelligence is partly the study of cognitive systems[1] as well as the context within which these (could) operate. While considering AI[2], one might want to shortly consider “context.”

Here “context” is the set of conditions and circumstances preceding, surrounding or following a cognitive system and that related to its processed, experienced, imagined or anticipated events. One might want to weigh how crucial conditions and circumstances are or could be to both machine and human.[3]  The field of AI is one of the fields of study that could perhaps offer one such opportunity.

A context is a source for a cognitive system to collect its (hopefully relevant) information, or at least, its data from. Cognitive Computing (CC) systems are said to be those systems that try to simulate the human thought processes, to solve problems, via computerized models.[4] It is understandable that some classify this as a subset of Computer Science while some will obviously classify CC as a (sometimes business-oriented) subset of the field of AI.[5] Others might link this closer to the academic work done in Cognitive Science. Whether biological or artificial, to a number of researchers the brain-like potentials are their core concern.[6]

As can be seen in a few of the definitions and as argued by some experts, the broad field of AI technologies do not necessarily have to mimic *human* thought processes or human intelligence alone. As such, AI methods might solve a problem in a different way from how a human might do it.

However similar or different, the meaning-giving information, gotten from a context, is important to both an AI solution as well as to a biological brain. One might wonder that it is their main reason for being: finding and offering meaning.

The contextual information an AI system collects could be (defined by or categorized as) time, locations, user profiles, rules, regulations, tasks, aims, sensory input, various other big to extremely huge data sets and the relationships between each of these data sets in terms of influencing or conflicting with one another. All of these sources for data are simultaneously creating increasing complexities, due to real-time changes (i.e. due to ambiguity, uncertainty, and shifts). AI technologies offer insights through their outputs of the *best* solution, rather than the one and only certain solution for a situation, in a context at a moment in spacetime.

The wish to understand and control “intelligence” has attracted humans for a long time. It is then reasonable to think that it will attract our species’ creative and innovative minds for a long time to come. It is in our nature to wonder, in general, and to wonder about intelligence and wisdom in specific; whatever their possible interlocked or independent definitions might be(come) and whichever their technological answers might be.

In considering this, one might want to be reminded that the scientific name of our species itself is a bit of a give-away of this (idealized) intention or aspiration: “Homo Sapiens.” This is the scientific name of our animal species. Somewhat loosely translated, it could be understood to mean: “Person of Wisdom”. 

In the midst of some experts who think that presently our intelligence is larger than our wisdom, others feel that, if handled with care, consideration and contextualization, AI research and developments just might positively answer such claim or promise and might at least augment our human desires towards becoming wiser.[7] Just perhaps, some claim,[8] it might take us above and beyond[9] being Homo Sapiens.[10]

For now, we are humans exploring learning with and by machines in support of our daily yet global needs.

For you and I, the steps to such aim need to be practical. The resources to take the steps need to be graspable here and now.

At the foundation, to evaluate the validity or use of such claims, we need to understand a bit what we are dealing with. Besides the need for the nurturing of a number of dimensions in our human development, we might want to nurture our Technological Literacy (or “Technology Literacy”).[11]

A number of educators[12] seem to agree that,[13] while considering human experiences and their environments, this area of literacy is not too bad a place to start off with.[14] In doing so, we could specifically unveil a few points of insight associated with Artificial Intelligence; that human-made technological exploration of ambiguous intelligence.

Few people know that heart comes http://appalachianmagazine.com/2016/12/25/why-stink-bugs-are-taking-over-the-eastern-united-states/ online prescription for viagra under pressure, when stress control of the mind. The uses of alcohol, smoking, tobacco, and http://appalachianmagazine.com/2019/03/10/mountain-tradition-eating-ramps-in-springtime/ super generic viagra illegal drugs have great influence of carrying this sexual complication. You on line viagra can also get discount on bulk order for this product. While viagra lowest prices some states don’t need that this online drug is not good as their branded counterparts are.




[1] Sun F., Liu, H., Hu, D.  (eds). (2019). Cognitive Systems and Signal Processing: 4th International Conference, ICCSIP 2018, Beijing, China, November 29 – December 1, 2018, Revised Selected Papers, Part 1 & Part 2. Singapore: Springer

[2] DeAngelis, S. F. (April 2014). Will 2014 be the Year you Fall in Love with Cognitive Computing? Online: WIRED. Retrieved November 22, 2019 from https://www.wired.com/insights/2014/04/will-2014-year-fall-love-cognitive-computing/

[3] Desouza, K. (October 13, 2016). How can cognitive computing improve public services? Online Brookings Institute’s Techtank Retrieved November 22, 2019 from https://www.brookings.edu/blog/techtank/2016/10/13/how-can-cognitive-computing-improve-public-services/

[4] Gokani, J. (2017). Cognitive Computing: Augmenting Human Intelligence. Online: Stanford University; Stanford Management Science and Engineering; MS&E 238 Blog. Retrieved November 22, 2019 from https://www.datarobot.com/wiki/cognitive-computing/

[5] https://www.datarobot.com/wiki/cognitive-computing/

[6] One example is: Poo, Mu-ming. (November 2, 2016). China Brain Project: Basic Neuroscience, Brain Diseases, and Brain-Inspired Computing. Neuron 92, NeuroView, pp. 591-596.  Online: Elsevier Inc. Retrieved on February 25, 2020 from https://www.cell.com/neuron/pdf/S0896-6273(16)30800-5.pdf  . Another example is: The work engaged at China’s Research Center for Brain-Inspired Intelligence (RCBII), by the teams led by Dr XU, Bo and Dr. ZENG, Yi. Founded in April 2015, at the CAS’ Institute of Automation, the center contains 4 research teams: 1. The Cognitive Brain Modeling Group (aka Brain-Inspired Cognitive Computation); 2. The Brain-Inspired Information Processing Group; 3. The Neuro-robotics Group (aka Brain-Inspired Robotics and Interaction) and 4. Micro-Scale Brain Structure Reconstruction. Find some references here: bii.ia.ac.cn

[7] Harari, Y. N. (2015). Sapiens. A Brief History of Humankind. New York: HarperCollings Publisher

[8] Gillings, M. R., et al. (2016). Information in the Biosphere: Biological and Digital Worlds. Online: University California, Davis (UCD). Retrieved on March 25, 2020 from https://escholarship.org/uc/item/38f4b791

[9] (01 June 2008). Tech Luminaries Address Singularity. Online: Institute of Electrical and Electronics Engineers (IEEE Spectrum). Retrieved on March 25, 2020 from  https://spectrum.ieee.org/static/singularity

[10] Maynard Smith, J. et al. (1995). The Major Transitions in Evolution. Oxford, England: Oxford University Press  AND Calcott, B., et al. (2011). The Major Transitions in Evolution Revisited. The Vienna Series in Theoretical Biology. Boston, MA: The MIT Press.

[11]  National Academy of Engineering and National Research Council. (2002). Technically Speaking: Why All Americans Need to Know More About Technology. Washington, DC: The National Academies Press   Online: NAP Retrieved on March 25, 2020 from https://www.nap.edu/read/10250/chapter/3

[12] Search, for instance, the search string “Technological Literacy” through this online platform: The Education Resources Information Center (ERIC), USA https://eric.ed.gov/?q=Technological+Literacy

[13] Dugger, W. E. Jr. et al (2003). Advancing Excellence in Technology Literacy. In Phi Delta Kappan, v85 n4 p316-20 Dec 2003 Retrieved on March 25, 2020 from https://eric.ed.gov/?q=Technology+LIteracy&ff1=subTechnological+Literacy&ff2=autDugger%2c+William+E.%2c+Jr.&pg=2

[14] Cydis, S. (2015). Authentic Instruction and Technology Literacy. In Journal of Learning Design 2015 Vol. 8 No.1 pp. 68 – 78. Online: Institute of Education Science (IES) & The Education Resources Information Center (ERIC), USA. Retrieved on March 25, 2020 from https://files.eric.ed.gov/fulltext/EJ1060125.pdf


IMAGE CREDITS:

An example artificial neural network with a hidden layer.

en:User:Cburnett / CC BY-SA (http://creativecommons.org/licenses/by-sa/3.0/) Retrieved on March 12, 2020 from https://upload.wikimedia.org/wikipedia/commons/e/e4/Artificial_neural_network.svg


AI, Impact Investment, Ethics & Deeply Human-Centered Innovation

Contents

 

This class of drugs also includes buy sildenafil india and viagra. What is the reason of Kamagra’s popularity? Increasing number of people are turning to the internet for their medication requirements, and this includes over-the-counter medications as well as prescribed ones like brand viagra overnight. Many men as well as couples prefer order generic viagra over other ED drugs. cialis causes fast and effective reaction by helping the men to get prolonged, harder and better erection. Many surveys revealed that after having a range of effective ED pills, still many order cialis canada of the ED sufferers for their sexual disorder.