标签归档 西安品茶网

145 pages of enterprise digital transformation Big Data Lake project construction and operation comprehensive solution WORD

This information source is open to the public, for personal study only, please do not use it commercially.
Part of the information content:

The application, management and display of data lake are integrated, providing standard services, data interfaces and report presentation methods. The data of data lake adopts efficient and reliable storage architecture. The enterprise business data migration plan is formulated, and the core data stored in ERP system, data acquisition system, OA system, video monitoring system and cloud business system are migrated to the data lake as a whole, and the inelastic resources are deployed locally. For the elastic computing function, it is necessary to cooperate with the algorithm data lake. So as to realize the controllability of core data and eliminate security problems and potential unknown risks. Support visual modeling, and support mouse dragging for artificial intelligence algorithm modeling. Including data preprocessing, feature engineering, algorithm model, model evaluation and deployment, etc., it supports many types of algorithm applications in the field of fast-selling business, including logistic regression, K nearest neighbor, random forest, naive Bayes, K-means clustering, linear regression, GBDT binary classification, GBDT regression and other algorithm models, and also supports artificial intelligence training models such as deep learning. The presentation layer displays the operation status and resource usage of various business systems in a multi-dimensional and dynamic way through unified business BI report components. And support the periodic or temporary generation of business situations, decision data display, fault analysis and mining and other business scenarios.

X x data lake architecture diagram

Document center:

It is mainly used to store files in various formats, including video files, video and audio files, PDF files, Office files and other types of files, and provides file-level full-text retrieval, document publishing, file sharing, file extraction and other functions. Provide file rights management, version management, version history recovery and other management functions.

The file content in the file center can exchange fusion data with the log center and data center through ETL process, and participate in data processing, data mining, machine learning, image analysis and so on.

Log center:

Collect all kinds of log data, IOT data and other real-time data, and the data will be processed in real time by the stream processing engine to ensure that the data will be analyzed and processed in the first time, so as to achieve real-time monitoring and real-time alarm.

The processed real-time data can be integrated with the data in the file center and data center to participate in data analysis.

Structured data center:

Real-time (or batch) access to structured data in databases or other media, and efficient processing of all kinds of data with the help of powerful processing capabilities such as Hadoop/Spark.

Effectively combine the data in file center and log center to participate in data analysis and data mining.

Support tens of billions of data Cube to achieve sub-second multi-dimensional query of massive data.

Standard SQL output interface, supporting the needs of continuous upgrading and secondary development.

Schematic diagram of unified interface of data lake interface

Data access principle

1. Give priority to the application-driven construction of high-value digital twin projects;

2. The data entering the lake must be certified by the data management department, and the corresponding data asset standards shall be issued to match the corresponding data responsible person;

3. The principle of data modeling is standardized step by step with original data, clean and integrated data, three normal forms structure and service wide table;

4. The overall platform shall conform to the principle of high availability and parallel expansion, and conform to the data planning of the business for 3-5 years.

Real-time data synchronization supports most real-time database synchronization requirements. Support data synchronization across WAN and receiver clustering. Build a unified, standard, easy to copy and maintain data real-time synchronization platform, and at the same time complete the technical specifications and strategies of data real-time synchronization. Realize data synchronization monitoring system, and build a continuous and reliable real-time monitoring system for data update. Complete the integration mechanism of one-time rapid data import and incremental data import-trickle replication. The Full Dump module is used to realize the encryption of data warehousing, and the HiveSQL interface is provided based on Data Handle, at the same time, the decryption of data warehousing is completed. Control of data access rights through customization of Application Adapter.

L The scheme of keeping the original database for business systems that frequently read and write data, ERP system, data acquisition system, OA system, video monitoring system and cloud business system. Business data should be synchronized to the data lake, and the consistency between local data lake and business system data should be verified periodically during the parallel operation.

L receive real-time incremental data and store the data in the local data lake according to the predetermined architecture. Real-time production data is accessed in real time and reliably transmitted to the company’s database cluster. The data access amount is about 110TB/ day, and the historical data is 40000TB.

Logical architecture diagram of data migration

L Data lake operations are divided into two categories: inelastic and elastic. For inelastic operations, operations are performed in the local data lake. For operations that consume large resources and need elastic calculation, collaborative calculation is adopted with the enterprise cloud, and data is not saved in the enterprise cloud data lake. After the operation calculation is completed, the process and result data are sent back to the local data lake for storage. Interface service supports publish-subscribe mode, cross-data lake and cross-system call, HDFS, Hive, HBase and other systems.

A) interface type

Bulk data encapsulation

A large number of data are extracted according to certain conditions and packaged into data resources. Batch data packaging must be carried out through the system, not manually.

Data request interface encapsulation

The data is encapsulated as an access interface by restful interface, so that the accessor can access the data through remote call.

B) interface security

configuration management

Configure the content of shared data and sharing interface rules, including basic data configuration, sharing service configuration, sharing rights and sharing configuration distribution.

A) basic data configuration

It can configure the basic data used in the data sharing functional domain, including the configuration of the shared data system, the data structure and semantic description of the shared data entity, and the sharing method.

B) shared service configuration

Data service definition, data service directory, data service parameter configuration (such as: target system, sharing mode, data bearing mode, access frequency, access permission period), etc.

C) sharing permission configuration

Configure the permissions of the target system that is allowed to use the shared service, and support the permissions configuration of specific data entities and attributes within the shared service.

D) shared configuration distribution

The content of shared data and sharing interface rules are distributed to all relevant systems.

Data sharing process

Monitoring, exception handling and log management of data sharing processes, and providing query statistics and analysis functions for data related to data sharing.

A) table data sharing

The target system is an application layer analysis system, which directly opens the access rights of tables, and the target system extracts data through ETL.

B) data query

The target system is an application layer analysis system, and the target system directly calls the data query service provided by the data lake to complete the data query.

C) data subscription

The target system is an application layer analysis system, and the target system puts forward data subscription requirements, and the data lake provides data subscription services.

Space is limited, so it can’t be fully displayed. If you like information, you can forward it+comment, and learn more by private message.

The reason of C Ronaldo’s bleak evening scene: mistaking the platform for ability

C Ronaldo has gone far away to the desert, so that the hot sunshine in the desert can cure the injury and pain of the defeated plum ball king!

At the age of c, he has gone to the desert, and his competitive career has been finalized.

Although there is a lot of wealth, it seems to be happy! But its heart is painful!

As we all know, Ronaldo is aloof and arrogant, and I am the only one!

I take the super plum ball king as my responsibility all my life, and pursue honor, glory and data all my life.

C Ronaldo has been a technical career for more than ten years, and his money has been free and his wealth has been satisfactory; It cares more about face, scenery and honor.

During C Ronaldo’s career, during his years in Real Madrid, he won four Golden Globes and four Champions League games, which strengthened his confidence and courage, boasted himself the first, second and third place in the world, was extremely inflated and self-mad.

C Ronaldo’s greatest misfortune is to regard the platform as an ability! It is the real Madrid platform and personal efforts that have made its real Madrid years dazzling!

C Ronaldo left Real Madrid, moved to Juventus and Manchester United, accomplished nothing, and never won the Golden Globe Award, Sir, or the Champions League again! 16 lang in the Champions League every year! Its juventus years are incomplete!

C Ronaldo’s national team was ruined, and fans said that it was lying in the European Cup. Although it was exaggerated, it was not groundless!

C Ronaldo’s World Cup achievement is hands-free, which is a stain on his life! Causing 10 zeros in the World Cup to be bleak. Single-core team leader has the best score in the top 16.

After the dark years after Real Madrid, nothing was achieved after the World Cup.

C Luo Fang understands that it is the Real Madrid platform and Lafayette that have made its brilliant Real Madrid era. I mistook the Real Madrid platform for my ability, woke up like a dream, and wanted to return to Real Madrid and relive the beauty. But the vicissitudes of life, powerless, was rejected by Real Madrid! I have to go to Saudi Arabia to waste the rest of my life!

Life is like this, when you miss the opportunity, it may be a lifetime of regret and regret!

JD.COM Group released Q4 and annual financial report: jingdong cloud laid out industrial AI to promote artificial intelligence landing industry.

JD.COM Group (NASDAQ: JD, HKEx: 9618) released its fourth quarter and annual results in 2022, with annual net income exceeding one trillion yuan. In the fourth quarter, its net service income was 57.8 billion yuan (about 8.4 billion US dollars), a year-on-year increase of 40.3%. Jingdong cloud, as the core brand of technology and services provided by JD.COM, continues to exert its cutting-edge technology and digital infrastructure to boost the real economy to achieve high-quality growth with the ability of digital intelligence supply chain.

Since the comprehensive transformation to technology in early 2017, JD.COM system has invested nearly 100 billion yuan in technology, continuously strengthened its own technical capabilities and industrial digitalization capabilities, and provided technology and services to the outside world with jingdong cloud as its core brand. In the field of AIGC and big model, jingdong cloud’s artificial intelligence application platform of Yanxi is preparing for the industrial version of ChatGPT to accelerate the landing of artificial intelligence technology in the industry. At present, Yanxi virtual anchor has served more than 4,000 brands; JD.COM Exploration Institute upgraded the model of the Weaver Girl, and topped the list of SuperGLUE, an international authoritative task evaluation for complex language understanding.

Jingdong cloud continued to dig deep into cutting-edge technology, and the Weaver Girl model topped the national list again.

In the basic model of common language understanding, JD.COM Exploration Institute upgraded the Vega v2 model with larger scale, stronger performance and better mobility. This model can be widely used in many downstream natural language processing tasks, such as sentiment analysis, semantic matching, grammar correction, intelligent question answering, common sense reasoning and so on. On the SuperGLUE list of international authoritative complex language understanding task evaluation, Vega v2 model surpassed top international institutions such as Google, Microsoft, Facebook and OpenAI, and topped the world with an average score of 91.3.

This is not the first time that the Weaver Girl model has won the championship. Vega v1 ranked first in the top test GLUE in the global natural language processing field with a total average score of 91.3. Vega-MT also won 7 track titles in WMT2022 International Machine Translation Evaluation. The winning of Vega series models proves that JD.COM Exploration Institute’s multilingual natural language processing technology is leading in the field of super deep learning.

Jingdong cloud exports industrial AI capabilities, and digital intelligence technology helps thousands of industries to grow with high quality.

Jingdong cloud’s artificial intelligence application platform is preparing for the industrial version of ChatGPT, and has announced the "125" plan of landing application roadmap. JD.COM will focus on two advantageous industries, namely retail and finance, and carry out technical research around five types of applications: content generation, man-machine dialogue, user intention understanding, information extraction and emotion classification, so as to accelerate the development and landing of artificial intelligence technology in China with the strength of industrial AI and promote the development of real economy.

In the field of digital people, jingdong cloud has launched the virtual anchor of Yanxi. Driven by AI, the virtual anchor of Yanxi has a changeable image, a voice comparable to that of a real person, and a wealth of e-commerce knowledge accumulation. At present, it has interacted smoothly with the audience in hundreds of brand live broadcast rooms, widely serving 3C home appliances, beauty cosmetics, maternal and child, pets, home and other stores, and providing digital anchor services for many well-known brands such as Lenovo, Nut and Yili, bringing millions of GMV growth every day.

Jingdong cloud, as a "more industry-aware" cloud, will not only deepen its technology and maintain its leading position in the industry, but also rely on its own practical experience in the industry to continuously precipitate technology into services and land in major industrial scenes, so as to build an efficient driving force for thousands of industries to build a digital and intelligent supply chain, promote the construction of a modern industrial system and help the industry rebuild its global competitiveness.

What does offside mean in football match? Will your own team be offside in the half court?

Offside is a professional term in football match, which first appeared in the Rules of Football Match promulgated in 1874. It is defined as: when the ball is passed in front of the attack direction, when the football kicks out, if the teammate of the defensive half is closer to the goal than the penultimate defender of the other side, and wants to use this position to interfere with the game or gain benefits, the player will be sentenced to offside.

There are many technical terms in football matches, such as penalty kick, corner kick, sideline kick, offside, etc. Among them, offside is often heard by many people, but its basic meaning is still ambiguous, and sometimes it is impossible to make an accurate judgment under special circumstances. Let’s explain it in detail below.

When we watch a football match, we often see a sideline referee raise a flag just after a player scores the ball into the opponent’s goal. The player who scored the goal has to shake his head helplessly and throw himself back into the attack. Naturally, the goal just scored will not count. Why? Because he was offside, he was offside at the moment when his teammates passed the ball to him.

Which position on the court is offside? In fact, there is no specific position, depending on the position of the players. First of all, offside can only happen in the opponent’s half court. At the moment when our own players pass the ball, we should observe the positions of all the players. When the player who is ready to catch the ball is at the forefront of all the players (excluding the goalkeeper of the opponent), he is offside. This position is not the position of his feet, but the position of his body, and his shoulders and head are leading.

In the past, offside was judged by the sideline referee with the naked eye, which naturally could not be absolutely accurate. But now, with the introduction of high-tech VAR technology, many offsides between millimeters can’t escape from the "eyes". However, although VAR makes the game fairer, it also reduces the excitement of the game to a certain extent and makes many beautiful goals invalid.

There are some special situations about offside on the court. For example, although the attacking player who receives the ball is ahead of all the defensive players (except the goalkeeper), he is still in his own half, so he is not offside at this time.

If a player on the attacking side is offside at the moment when the player passes the ball, but he doesn’t catch the ball or participate in the next attack, then he is not offside.

The last situation: although a player on the attacking side is in an offside position, he just runs with him, participates in the stall, and creates shooting opportunities for his own players, but he never touches the ball from beginning to end. Is he offside? Real fans, please tell me in the comments section.