Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Time

Title

Presenter

Presenter Organization or Company

Category

Tracks

View file
nameClussys_AI_Networking_LF.pdf

Slides

10:00

Opening

 

View file
nameOPEA _Bytedance_AI4D_Sept_2024.pptx

10:10

Use Coze to simplify your AI development

Gary Qi

Bytedance

Development with AI

10:25

AI4D: Bridging the Gap Between Edge AI and Distributed Cloud Computing

Tina Tsou

ByteDance; InfiniEdge AI

Infrastructure, architecture

InfiniEdge AI

View file
nameAI4D-Bridging-the-Gap(1).pdf

11:05

Optimizing Edge and Cloud Inference Systems for Collaborative Large Language Models

Wenhui Zhang

Bytedance

Co-Ding Studio

Infrastructure, architecture

Akraino; InfiniEdge AI;

View file
nameOptimizing Edge and Cloud Inference Systems for Collaborative Large Language Models (1).pdf

11:45

Network and Computing Infrastructure for the 2030s

Haruhisa Fukano

Fujitsu

Infrastructure, architecture

Akraino;

View file
nameNetwork and Computing Infrastructure for the 2030s.pdf

12:15

Revolutionizing Edge Computing: AI and IoT with InfiniEdge AI

Tom Qin; C.C.

Edgenesis; Allegro

Infrastructure, architecture

InfiniEdge AI

https://docs.google.com/presentation/d/1hv27o1EPtvGGsVzW681ERJI-2q5eIVq-bkQSYzyOt6Y/edit?usp=sharing

13:00

Lunch

 

 

 

 

14:00

AutoSE: An Agentic AI Application Workflow

Zixuan Zhang

Bytedance

Development with AI

Linux Foundation AI & Data

View file
nameAutoSE_ An Agentic AI Application Workflow (2).pdf

14:40

Write Once Run Anywhere, but for GPUs

Michael Yuan

WasmEdge

Infrastructure, architecture

InfiniEdge AI;Linux Foundation AI & Data;

View file
nameWrite once, run anywhere, for GPUs.pdf

15:20

From '+AI' to 'AI+':China Mobile's New Strategic Planning and Practice in AI Era

Lingli Deng

China Mobile

Infrastructure, architecture

Linux Foundation AI & Data; Akraino;

16:00

Using AI to scale global scale massive applications

Sujata Tibrewala

Bytedance

Development with AI

InfiniEdge AI

16:40

AI Networking: From TCP/IP to RDMA and UEC over CXL and UALink

Dr. Fu Li

Clussys Inc.

Infrastructure, architecture

Linux Foundation AI & Data

17:20

An overview of IOWN Global Forum and a hardware-accelerated pipeline for efficient AI analysis over All Photonics Network

Rintaro Harada

NTT

Infrastructure, architecture

View file
nameAI for Developers_NTT.pdf

17:35

Closing

 

 

 

 

Summary

The meeting discussed AI and edge computing, the main contents included:

  • AI applications: in various fields like Auto SE Kim, WASM runtime, and China Mobile's strategic planning.

  • Challenges and opportunities: in model coupling, edge computing, and industry collaboration.

  • Demonstrations: showcasing the capabilities of these technologies.

  • Open source projects and tools: such as ByteDance's Babid Multimedia Framework and BMF libraries integrated with Hugging Face Library.

  • AI networking innovations: and the role of telcos in hosting AI services.

  • Upcoming event: "futures in Taipei".

Chapters

00:00 Advances and Applications of AI in Edge Computing and Distributed Cloud

This section features presentations on AI agent building and capabilities. Gary from codes discusses its various uses and how to build one. Tina Tsou elaborates on bridging the gap between edge AI and distributed cloud computing, covering challenges, tools, security, and future prospects. There's also mention of related workstreams, projects, and opportunities for collaboration.

38:21 Edge and Cloud Collaborative Support and Security for Large Language Models

This section focuses on the deployment and optimization of edge and cloud inferences for large language models. It covers various deployment models, model conversion and optimization, reduction of model size for edge devices, scheduling for fast execution, and a framework for attestation and evidence support. Different techniques and challenges are discussed along with solutions.

01:20:19 The Future of Network and Computing Infrastructure: Insights and Initiatives by Harohisa Fukano from Fujitsu

This section is about Harohisa Fukano from Fujitsu presenting on the future of network and computing infrastructure for the 2030s. He covers LFH projects, computing challenges like power consumption and flexibility, the bottleneck disaggregated computer, joint efforts with the Ion Global Forum, and potential use cases including generative AI plus robotics and green energy data processing, along with upcoming POC phases.

01:50:24 Edge Computing, IoT, and Function Calling in AI Projects Presentation and Demos

This section presents edge computing, IoT, and function calling. Tom introduces Shifu for IoT, detailing its architecture, use cases like with the California Strawberry Commission and a supermarket chain. CC discusses Geo distributed computing, function calling, and related improvements. Demos of Shifu integration and function calling are shown. Questions and discussions follow.

02:30:31 Discussions on Various Topics including Technology, Business, and Education

This section is a complex and lengthy discussion covering various topics including lunch and networking plans, business models, technology applications like TikTok, issues related to code and software development, legal aspects in certain fields, and experiences with different systems and services. The conversation also touches on teams, support, and various challenges and opportunities.

03:32:20 Overview of Auto SE and Comparisons in AI Application Workflow

This section mainly focuses on various discussions related to Auto SE and its features. Li Chuan presents on Auto SE, including its capabilities, comparison with Github Pilot, key highlights like link and live features, switch setup actions, direct support, standing in SMB benchmark leaderboard, and cost savings compared to SMB agent. Also, there are mentions of future discussions and ongoing targeting.

04:14:53 The Need for Tightly Coupled Language Models and Web Assembly in Applications

This section is mainly presented by Michael Yu Yan. He discusses the need for tightly coupling large language models with specific knowledge bases and applications. He gives examples like the chemistry and Rust programming demos, highlighting issues with Python and cross-platform compatibility. He also mentions web assembly as a solution and showcases a cross-GPU demo. Everything presented is open source.

04:56:05 Discussions on AI Applications and Open Source Projects in Various Contexts

This section includes various discussions on AI-related topics. It covers China Mobile's strategic planning in the AI era, such as AI in operation services, creativity, new quality productivity, and strategic decision making. It also features ByteDance's open source projects and their potential in addressing AI challenges. Sujata Tibrewala from ByteDance discussed AI's impact on jobs and the environment.

06:38:08 Advancement and Comparison of AI Networking Technologies and Protocols

This section discusses advancements in AI networking, focusing on scale up and scale out networking. It explores protocols like TCPIP and RDMA, links such as NV Link and CXL, and the need for efficient data transfer in GPU clusters. The presenter also compares different technologies, emphasizing low latency and high performance, and considers future positions regarding networking architectures and protocols.

07:12:51 Overview and Use Cases of Ion Global Forum and Upcoming Event

This section involves Rintaro Harada from NTT presenting about the Ion Global Forum. He introduces its overview, including the aim to create a sustainable society, its founding by NTT, Intel, and Sony. Discusses goals, technical aspects, POC activities, use cases like Smart City, and potential applications in various industries. Also announces an upcoming event in Taipei next month.

Day2 (Sep. 13)

Lark Meeting: https://www.us.larkoffice.com/calendar/share?token=e8edd94e9827b518b68b279ca5120824

...