1
00:00:00,500 --> 00:00:03,410
Hello and welcome back to the course on augmented random search.

2
00:00:03,500 --> 00:00:09,290
In today's world we're going to talk about the perception and we'll find out exactly how it works and

3
00:00:09,290 --> 00:00:10,120
what it is.

4
00:00:10,370 --> 00:00:13,040
So in the various tutorial this is where we left off.

5
00:00:13,040 --> 00:00:18,370
So let's quickly recap the environment such as the magical environment will give us certain inputs.

6
00:00:18,380 --> 00:00:25,910
For example the forces that are applied to the feet of the model then artificial intelligence comes

7
00:00:25,910 --> 00:00:29,220
in at that point it needs to analyze the inputs.

8
00:00:29,270 --> 00:00:35,240
There might be three of them might be 20 maybe 100 inputs need to analyze needs to analyze the inputs

9
00:00:35,540 --> 00:00:42,080
and perform or output certain decisions on what is is it going to do is it going to tell the model to

10
00:00:42,830 --> 00:00:50,330
contracture muscles it is going to tell it to lift this leg and move this right finger or foot this

11
00:00:50,330 --> 00:00:55,100
way and so on and then it will have certain outputs that will control them.

12
00:00:55,130 --> 00:00:56,800
Again that can be sort of output.

13
00:00:56,810 --> 00:01:01,200
For instance would Jako humanoid model has 22 degrees of freedom.

14
00:01:01,550 --> 00:01:09,490
Then we have the outputs the outputs go back into the environment and they control the model or the

15
00:01:09,490 --> 00:01:16,130
agent in its quest in order to walk successfully or run or whatever it's doing.

16
00:01:16,390 --> 00:01:22,010
So that's overall the structure what hides behind this like box of a work.

17
00:01:22,030 --> 00:01:22,840
What is it.

18
00:01:22,840 --> 00:01:26,290
Well this is where a person perception comes in.

19
00:01:26,410 --> 00:01:33,970
So this is our perception that we hear in the air as we use a simple perception which is basically just

20
00:01:34,000 --> 00:01:39,730
taking these inputs and applying certain weights to them and then adding them up.

21
00:01:39,730 --> 00:01:48,720
So as you can see in this case our output will be the sum of x 1 times w 1 plus x 2 times W2 plus x

22
00:01:48,730 --> 00:01:49,890
three times three.

23
00:01:49,930 --> 00:01:51,530
That's what the sum symbol means.

24
00:01:51,670 --> 00:01:57,460
And we add them up at those Wadewitz get the weighted some of the R inputs and that's our output.

25
00:01:57,520 --> 00:02:05,230
So our inputs and all of those forces all the whole stack situation in the environment can be described

26
00:02:05,230 --> 00:02:11,260
if a vector of inputs is just a vector of numbers and our output is for instance this output might be

27
00:02:11,260 --> 00:02:16,930
connected to the the arm of the model to the right arm.

28
00:02:16,930 --> 00:02:23,770
And so this is what we're going to do with the right arm is basically simply a sum of the weighted inputs.

29
00:02:23,800 --> 00:02:28,060
That's all it is and that's what a person perception is.

30
00:02:28,240 --> 00:02:33,310
And now we're going to we're going to remove these labels are here so that it's not distracting for

31
00:02:33,310 --> 00:02:33,880
us.

32
00:02:34,220 --> 00:02:39,640
Now that's a very simple single output for perception.

33
00:02:39,640 --> 00:02:43,190
Now we're going to increase the complexity a bit.

34
00:02:43,210 --> 00:02:49,980
For instance here we've got a perception with two outputs but the concept still remains the same.

35
00:02:50,110 --> 00:02:52,360
In each case we will have certain weights.

36
00:02:52,360 --> 00:02:58,060
Now they have to be longer of an index so that we know which way applies where in this case the three

37
00:02:58,060 --> 00:02:58,960
weights apply here.

38
00:02:58,960 --> 00:03:07,100
So it's the sum of x X-1 times w 1 1 plus x 2 times w 2 1 plus x 3 times w 3 1.

39
00:03:07,120 --> 00:03:13,920
So the first index means which input this weight is related to the second index means which output it's

40
00:03:13,930 --> 00:03:18,640
related to and for the second output we're going to have a different set of weights.

41
00:03:18,850 --> 00:03:24,320
1 2 2 2 and 3 2 and they're all going to come into this output.

42
00:03:24,520 --> 00:03:27,360
And there we go that's our output.

43
00:03:27,370 --> 00:03:33,490
So for instance this could be the left arm this could be the right arm or the left foot right foot so

44
00:03:33,490 --> 00:03:36,960
basically it will be telling based on this number.

45
00:03:37,300 --> 00:03:42,670
This number will be interpreted by the environment and also want our AI.

46
00:03:42,760 --> 00:03:43,820
So this is our AI.

47
00:03:43,840 --> 00:03:50,480
Basically what our AI once the model to do is it wanted to put the left foot forward with a foot back.

48
00:03:50,650 --> 00:03:53,280
And of course this can get more complex.

49
00:03:54,020 --> 00:03:57,360
You can see here that we can have more inputs.

50
00:03:57,370 --> 00:04:03,610
We can have more outputs outputs can number about can be greater the number of inputs it doesn't really

51
00:04:03,610 --> 00:04:04,240
matter.

52
00:04:04,480 --> 00:04:11,230
And so here again we're going to have the sum of weights the weighted sum of all of these inputs of

53
00:04:11,230 --> 00:04:13,720
course the weights are going to be unique.

54
00:04:13,720 --> 00:04:17,470
So as you can see the number of weights is growing quite fast here.

55
00:04:17,980 --> 00:04:20,650
So in this case we have for each one there are four of these.

56
00:04:20,650 --> 00:04:24,280
Verify these for each one we have for each one of these.

57
00:04:24,280 --> 00:04:25,600
We have five weights.

58
00:04:25,660 --> 00:04:27,190
We have 20 weights in total.

59
00:04:27,190 --> 00:04:33,550
That's why instead of writing them out separately we're writing them out as a matrix as matrix as you

60
00:04:33,550 --> 00:04:35,430
can see their road around out here.

61
00:04:35,440 --> 00:04:39,280
So there are a weights and Basen index you can tell.

62
00:04:39,280 --> 00:04:45,910
So for instance W 2 3 is the way that will be applied to in part number two in the calculation of our

63
00:04:46,000 --> 00:04:51,970
number three in the some of that has been calculated so for in order to calculate output number three

64
00:04:52,360 --> 00:04:56,150
we're going to have to use the column of weights.

65
00:04:56,380 --> 00:05:01,910
So w want 3 will be applied to this there'll be two three applied to this though with this double for

66
00:05:01,910 --> 00:05:10,480
three to this all that will be added up and we'll get the output for this for this output which might

67
00:05:10,480 --> 00:05:14,030
mean something specific which is something specific for the world.

68
00:05:14,260 --> 00:05:22,060
So that's basically how a percent perceptual perception works takes the inputs applies a weighted sum

69
00:05:22,690 --> 00:05:24,740
to get each one of the outputs.

70
00:05:25,270 --> 00:05:28,530
And there's nothing more complex than that.

71
00:05:28,570 --> 00:05:33,940
And that's the beauty of this algorithm there is that there is no this is actually called shallow learning

72
00:05:33,940 --> 00:05:37,970
because we don't have any hidden layers is only an input and output layer.

73
00:05:38,200 --> 00:05:44,650
And because of that it is say it's a sham because it's a shallow learning algorithm.

74
00:05:44,650 --> 00:05:53,160
It's much faster and yet it's so simple and still get some great results as we will discuss further.

75
00:05:53,800 --> 00:06:00,040
And I normally don't go into code that's a LUNs area of expertise but I just wanted to give you a heads

76
00:06:00,040 --> 00:06:00,190
up.

77
00:06:00,190 --> 00:06:06,240
So this is the code that you will see in the practical tutorials and I wanted to show you exactly where

78
00:06:06,240 --> 00:06:13,210
the perception is here because you kind of like miss it in the code and then like look through it.

79
00:06:13,350 --> 00:06:15,780
We don't have a specific name.

80
00:06:15,780 --> 00:06:17,430
Overclass for Perceptor.

81
00:06:17,580 --> 00:06:22,520
But it's really interesting to kind of like search for it on your own and look for it.

82
00:06:22,530 --> 00:06:29,100
When I was watching unflustered Rose I really had fun like relating back to the theory of ERs and understanding

83
00:06:29,100 --> 00:06:32,590
where each component is and where the perception is here.

84
00:06:32,670 --> 00:06:34,800
It's hiding in this line.

85
00:06:34,950 --> 00:06:37,620
So this is the perception.

86
00:06:37,680 --> 00:06:40,580
So this what we just created here which we just describe.

87
00:06:40,740 --> 00:06:42,170
That's our perception.

88
00:06:42,450 --> 00:06:44,130
So that's the inputs.

89
00:06:44,610 --> 00:06:51,210
That's the matrix multiplication and that's the way it's all of the matrix and then return means that

90
00:06:51,330 --> 00:06:54,890
it is outputting the results of that output.

91
00:06:54,900 --> 00:06:58,600
And so what is made why is Matrix matrix multiplication here.

92
00:06:58,710 --> 00:07:05,040
Well because if we go back what we can see here is we've got a vector of inputs.

93
00:07:05,160 --> 00:07:10,490
Then we've got a matrix of weights and then we're going to make a way a vector of outputs.

94
00:07:10,530 --> 00:07:17,700
And so in order in mathematical terms or in programming it's simply a matrix multiplication of this

95
00:07:18,560 --> 00:07:26,150
times this matrix of this vector Tam's matrix to get in order to get this vector of outputs.

96
00:07:26,400 --> 00:07:29,760
And so that is the perception right over there.

97
00:07:29,760 --> 00:07:36,870
So even though it's like it looks like a complex construct when we draw it in reality is just one line

98
00:07:36,870 --> 00:07:37,920
of code so don't miss it.

99
00:07:37,920 --> 00:07:41,130
It's like 53 when you're actually building the AI.

100
00:07:41,140 --> 00:07:43,770
We've Adlon in the practical tutorials.

101
00:07:43,920 --> 00:07:44,600
All right.

102
00:07:44,610 --> 00:07:47,130
On that note we're going to wrap up for today.

103
00:07:47,130 --> 00:07:51,180
That's what the perception is very a very simple concept.

104
00:07:51,210 --> 00:07:56,820
But as you'll see is going to yield some amazing results and you'll actually code this yourself if you

105
00:07:56,820 --> 00:08:02,880
do the practical tural you called this yourself and you'll see how much power the single perception

106
00:08:03,210 --> 00:08:06,180
has in the context of Arison is important.

107
00:08:06,180 --> 00:08:11,460
Understand that Eris is not just the perception it's all it's got some additional things additional

108
00:08:11,460 --> 00:08:13,090
features that we'll discuss as well.

109
00:08:13,230 --> 00:08:21,720
But in the context of Eris a perception turns out to be extremely powerful in solving artificial intelligence

110
00:08:21,900 --> 00:08:23,030
challenges.

111
00:08:23,040 --> 00:08:25,210
On that note I hope you enjoy today's tutorial.

112
00:08:25,290 --> 00:08:26,770
I look forward to seeing you next step.

113
00:08:26,850 --> 00:08:28,560
Until then enjoy I.