I am the only one who practices magic: I practice magic in the city

Chapter 160: Go to the basketball team and show off

Chapter 160: Go to the basketball team and show off (first update)
"Orange...hehehei..." Youzi giggled a few times, and her laughter was full of emotion.

Fang Yu glanced at Youzi. In a sense, this big model could also be considered Youzi’s child.

I just don’t know how far this child can grow?
The underlying layer of the Orange big model is not only composed of multiple neural networks, but also implies a simplified version of the Orange architecture rules, with more than 10 million parameters. The size of the big model alone exceeds G.

The parameter of 3 million is a very terrifying scale in the current year 3061 of the lunar calendar.

The number of parameters of Deep Q-Network, which Deepmind just announced a few months ago, is only 168 million.

Although the number of parameters of DeepFace, a deep learning facial learning system released by Feisibu in the middle of the year, has not been announced, it is speculated that it should be at the level of more than 10 million parameters.

The Google Brain Project, which Google released three years ago, used 10 CPUs for training and claimed to have billion parameters, but the proportion of invalid parameters and negative parameters exceeded %.

Although unsupervised learning on the video side was also achieved, the training effect was not good.

But the Orange model is different.

Since Yuzu completed the framework construction of the Orange model in his own body, with the assistance of arcane magic, the invalid parameters and negative parameters among the 10 million parameters of the Orange model can be basically controlled within %!

It can be said that the newly born Orange model is currently the most powerful AI model in the world!
The artificial intelligence parameters under the neural network are equivalent to the synapses in the human brain.

The number of parameters is one of the most important factors affecting the capabilities of artificial intelligence models, and even the decisive factor.

More parameters generally mean that the model has higher representation power and can capture and express more complex patterns and relationships.

In simple terms, the more parameters there are, the more human-like the artificial intelligence becomes.

Moreover, models with more parameters can better fit the training data and reduce the training error.

In simple terms, the more parameters there are, the stronger the AI's ability to understand will be.

In general, it is true that the more parameters there are, the stronger the capabilities of artificial intelligence are.

Although there is only 40G of training data at present, the Orange model has demonstrated a considerable level of intelligence.

This also shows that the deep learning training framework created by YouZi is much more efficient than the TensorFlow training framework version 0.5 released by Google just one month ago.

It is worth noting that the AI ​​training framework and the model framework of the AI ​​big model are two different things.

For example, the Orange model, the multi-layer neural network used and the hierarchy and connection method of the neural network are the Orange model framework.

The training framework is a software platform that provides tools and interfaces for building, training, evaluating, and deploying deep learning models.

To put it simply, if the large model framework that has not been trained with data is a brand new brain, then the training framework is the school, the teacher, and the entire education system.

The hierarchy and structure of the AI ​​big model framework itself is the IQ of this new brain.

The training data is the knowledge that the education system teaches to this new brain using various methods.

Teachers have different levels, different education systems, and different knowledge taught, so the efficiency and accuracy of students' mastery of knowledge will naturally be different.

Whether a student's academic performance is good or not depends on his or her personal IQ and efforts on the one hand, and on whether the education method and system are scientific and the teacher's teaching level on the other.

On the other hand, this knowledge should be correct. Teaching wrong knowledge to students will be of no use in exams and practical applications.

Similarly, contaminated erroneous data cannot be used to train a usable AI large model. Using contaminated data to train a large model will result in the trained large model having almost no practicality.

The three complement each other and are indispensable.

Otherwise, how could school district housing be sold at such high prices?
Otherwise, why would tutoring classes be so expensive?

"Yuzi, use the Yuzi Technology account to upload the pre-processing technology of the training framework to GitHub in batches in the order of pre-processing every three days, and choose the Apache 2.0 license." "Then, write three papers on the multi-head attention mechanism and post them to arXiv once a week."

"In addition, we are looking for high-tech talents in Da Zhou on Github, arXiv, and LinkedIn. The requirements are as follows..."

Fang Yu gave Youzi three clear instructions.

It’s time to find a technical team for Youzi Technology. Otherwise, no one would believe that a small company with only three employees could suddenly come up with a training framework and a mature AI model.

As a startup, how can you attract high-level technical talent?
It’s very simple, you just need to be a high-level technical talent first.

Genius has a clustering effect.

These things put on GitHub are bait.

Both the pomelo and orange models will definitely be hidden. Fang Yu plans to strip the orange model down to its most basic framework and then hand it over to these geniuses to fill in. If the filled model is not as efficient as the one made by pomelo, he will modify it himself.

In short, just keep your abilities at the level of a top genius and make sure that what you produce is not suspected by others.

In fact, the core members of a large model team and training architecture team are often not numerous, perhaps only a dozen or even a few people.

Therefore, Fang Yu only needs to recruit three to five algorithm scientists, five to ten engineers, three data processing personnel, and a dozen clerical staff to support this large model team.

The total number of people on the product side can be controlled within 30 people.

Moreover, on the product side, Fang Yu does not plan to hire any foreigners.

It’s not that Fang Yu has a strong sense of nationalism, it’s mainly out of consideration for confidentiality.

Since he is in Da Zhou, if any accident occurs, he can deal with it as soon as possible, but if he is abroad, it will be more troublesome.

If it were other companies, they might still have concerns that it would be difficult to recruit top talents in Dazhou.

But Youzi Technology does not need to worry about this. Fang Yu is looking for high-level talents, not top talents.

If it weren't for the practical issues, he alone, with a financial and operations team, could build the entire product side by himself without the help of anyone else, and the efficiency would be even higher.

At that time, the only department that may need a large number of manpower will be the AI ​​alignment department. To put it bluntly, it is to align the ethics of AI with those of human society.

This part of the employees cannot be laid off. We need full-time social science experts and a large number of testers to discover the ethical issues of AI and prevent them from happening through various strange conversations with AI.

No matter where you save, the auditors can't save.

However, these are later stories.

Before that, Fang Yu had to find an HR for Youzi Technology.

Oh, no, I have to go to the basketball team and show off first.

I have tried my best to write this chapter in a simple and easy-to-understand way and revised it many times, but I still retained this part of the content.

Because there are too many things surrounding artificial intelligence in the future, we should first try to make everyone understand what the artificial intelligence model is, what the principle is, and how an artificial intelligence is born.

The author is not showing off or padding the word count with these things, but to illustrate that in real life, if the protagonist really comes up with a separate training framework and model framework, how can he release this model without arousing suspicion, and how can he maximize his own interests from a professional perspective.

In this way, the subsequent plot excitement can be created.



(End of this chapter)

Tap the screen to use advanced tools Tip: You can use left and right keyboard keys to browse between chapters.

You'll Also Like