Introduction to MongoDB and .NET Core Getting Started

2020年10月18日 14点热度 0人点赞 0条评论
内容目录

After setting up the MongoDB cluster yesterday, I began to plan an understanding of MongoDB and introduce use cases. Here, I will share some notes from the learning process to help readers quickly understand MongoDB and code with C#.

Introduction to MongoDB

What is MongoDB

MongoDB is a NoSQL database that primarily features the storage of structured data. It is an open-source database system based on distributed file storage.

Structured Data

In the past, we used databases like MySQL and SQL Server, where data was stored in rows. The structured data in MongoDB distinguishes itself from this tabular format.

Structured data has a hierarchical relationship:

For example:

{
     "name": "MongoDB",
     "type": "database",
     "count": 1,
     "info": {
         "x": 203,
         "y": 102
     }
}

Structured Data

MongoDB and Relational Databases

Since there are no tables, rows, or columns in MongoDB, beginners might find it confusing. Below is a comparison of terms between MongoDB and typical SQL databases.

| SQL Term/Concept | MongoDB Term/Concept | Explanation |
| :---------------- | :------------------- | :--------------------------------------------- |
| database | database | Database |
| table | collection | Database Table/Collection |
| row | document | Data Record Row/Document |
| column | field | Data Field/Domain |
| index | index | Index |
| table joins | | Non-relational databases do not have table relationships |
| primary key | primary key | Primary Key, MongoDB automatically sets the _id field as the primary key |

Source: https://www.runoob.com/mongodb/mongodb-databases-documents-collections.html

Getting Started with MongoDB Commands

After entering the MongoDB shell using mongo, you can execute commands (similar to SQL) for operations.

Note: In MongoDB, there is an automatic _id field, which is automatically set as the primary key by MongoDB.

Show all databases (including system databases):

show dbs

Current database or collection in use:

db

Connect to a specified database:

use {database_name}

Show all collections:

show collections
# or
show tables

View all documents in a collection:

# MyCollection is the name of the collection
db.getCollection("MyCollection").find()
db.getCollection("MyCollection").find().limit(1000).skip(0)

You may not believe it, but I searched on Baidu for a long time and could not find a suitable and friendly article on "how to view all documents in MongoDB," especially with a lot of garbage articles on CSDN. I suggest not to fumble around; download Navicat Premium, which will prompt the commands used at the bottom during operations.

Using Tool to View MongoDB Commands

In addition, MongoDB has many useful tools: https://docs.mongodb.com/tools/

Documents

In MongoDB, a document is equivalent to a record (row) in a relational database.

However, in MongoDB, documents in a collection (Collection-Table) do not need to have the same fields. For example:

Document A:

{
     "name": "MongoDB",
     "type": "database",
     "count": 1,
     "info": {
         "x": 203,
         "y": 102
     }
}

Document B:

{
     "name": "MongoDB",
     "typeName": "database",
     "dataCount": 1,
     "dataInfo": {
         "m": 203,
         "n": 102
     }
}

.NET Core Example

We will start with a basic template.

Create a console application, open Nuget, and install MongoDB.Driver.

            var client = new MongoClient("mongodb://{MongoDB}:27017");
            IMongoDatabase database = client.GetDatabase("Test");

Collections

You can create a collection using CreateCollection() or CreateCollectionAsync(). Unlike traditional databases, you do not need to specify a structure when creating a collection; just specify the name:

await database.CreateCollectionAsync("Test");

Get Collection

The GetCollection() function allows you to retrieve a collection. If the collection does not exist, it will be created automatically.

IMongoCollection<TDocument> GetCollection<TDocument>()

Since the same collection can have documents with different fields and field types, it can be quite challenging to unify several documents if they differ, for example:

Documents in Collection Having Different Fields

(N/A) indicates that the document does not have this field; if one document has 10 fields and another has 8 fields, but none of their fields are the same, merging them would result in 18 fields. Clearly, they should not be grouped together, but should instead be "archived" using strong types.

Create two classes, Test1 and Test2, with the following content:

    public class Test1
    {
        public string Name { get; set; }
    }

    public class Test2
    {
        public string DataType { get; set; }
    }

Get the collection with two document types:

            var collection1 = database.GetCollection<Test1>("Test");
            var collection2 = database.GetCollection<Test2>("Test");

This means you're obtaining the operational capability for documents of this format in the collection.

Insert data into the collection:

            collection1.InsertOne(new Test1 { Name = "Test1" });
            collection2.InsertOne(new Test2 { DataType = "Test2" });
			// await collection.InsertOneAsync(object);

Run and check the results.

file

InsertMany() can insert multiple data items:

            Test1[] datas = new Test1[]
            {
                new Test1 { Name = "Test1" }
            };
            collection1.InsertMany(datas);

Counting Documents

To get the total number of documents in a collection:

collection1.CountDocuments(new BsonDocument())
// await collection1.CountDocumentsAsync(new BsonDocument());

For any document collection object, using CountDocuments(new BsonDocument()) obtains the count of all documents in that collection, rather than just the count of that specific document type. For example:

            var collection1 = database.GetCollection<Test1>("Test");
            collection1.CountDocuments(new BsonDocument())

The result is not the count of Test1 documents but the count of all documents in the collection.

The reason is that CountDocuments() is a filtering function that can use specified conditions to filter the count of the matching documents. This will be discussed further.

Queries

Queries in MongoDB do not operate like expressions in LINQ, based on the IEnumerable or IEnumerable<T> interface, meaning there are no Where or Select query methods provided in the driver.

The Find() function is the query function where you can add extensive expressions to filter documents. Once the data loads into local memory, you can use enriched expressions.

BsonDocument is a type representing the document search conditions. If a BsonDocument object does not add any properties, then by default, all documents meet the condition unless specified otherwise.

We should add a property to both Test1 and Test2 types:

        public ObjectId _id { get; set; }

Otherwise, a formatting error may occur: System.FormatException.

How to Serialize Documents

document is the document object, and JsonSerializer is a static class from System.Text.Json.

Console.WriteLine(JsonSerializer.Serialize(document));

Query the First Record

var document = collection1.Find(new BsonDocument()).FirstOrDefault();

Potential Issues Without Conditions

The following code may cause a runtime error:

            var documents = await collection1.Find(new BsonDocument()).ToListAsync();
            foreach(var item in documents)
            {
                Console.WriteLine(JsonSerializer.Serialize(item));
            }

This is because collection1 is marked as a collection of Test1 documents, but .Find(new BsonDocument()) attempts to query all documents, therefore retrieving Test2 documents.

However, Test2 cannot be converted to Test1, resulting in a runtime error.

View All Documents

var documents = collection1.Find(new BsonDocument()).ToList();
var documents = await collection1.Find(new BsonDocument()).ToListAsync();

As mentioned earlier, if the collection contains other formats of documents, attempting to retrieve all documents may cause an error since Test2 and Test1 are not related.

If the document count is large, use the asynchronous ForEachAsync() query, which operates on a callback principle.

            List<Test1> tests = new List<Test1>();
            Action<Test1> action = item =>
            {
                tests.Add(item);
                Console.WriteLine(JsonSerializer.Serialize(item));
            };

            await collection1.Find(new BsonDocument()).ForEachAsync(action);

End of Queries

After using Find() and subsequent functions for querying, if you want to terminate the query (lazy loading), you can use the ToCursor() function to finish; the program will immediately start querying and return data to memory.

Transform Queries

Using ToEnumerable() allows you to query documents with LINQ.

var list = collection1.Find(new BsonDocument()).ToCursor().ToEnumerable();

Filters

Previously, we used .Find(new BsonDocument()) in our queries; BsonDocument is the filter object that holds the filtering rules. However, we cannot set attributes directly in new BsonDocument(). Instead, we need to utilize the builder FilterDefinitionBuilder object, which can be created using MongoDB.Driver.Builders<TDocument>.Filter.

Assuming you have the following data set (documents):

5f8bdf88e63d14cb5f01dd85	小明	19
5f8bdf88e63d14cb5f01dd86	小红	20
5f8bdf88e63d14cb5f01dd87	小张	16
5f8bdf88e63d14cb5f01dd88	小小	17
    
# -----Code for Inserting Data-----
    public class Test
    {
        public ObjectId _id { get; set; }
        public string Name { get; set; }
        public int Age { get; set; }
    }

            var datas = new Test[]
            {
                new Test{ Name="小明", Age=19},
                new Test{ Name="小红", Age=20},
                new Test{ Name="小张", Age=16},
                new Test{ Name="小小", Age=17}
            };

            collection.InsertMany(datas);

Using the builder:

FilterDefinition<Test> filter = Builders<Test>.Filter

Querying documents where Age is greater than 18:

FilterDefinition<Test> filter = filterBuilder.Where(item => item.Age >= 18);

Retrieve results:

Test[] documents = collection.Find(filter).ToEnumerable<Test>().ToArray();

Filters also include Gt(), In(), Lte(), and other non-LINQ functions, which require looking at the documentation for learning.

Builders<TDocument>

Builders<TDocument> not only generates filtering builders but also several other types of builders:

		// Condition Filters
        public static FilterDefinitionBuilder<TDocument> Filter { get; }

		// Index Filters
        public static IndexKeysDefinitionBuilder<TDocument> IndexKeys { get; }

		// Projector, equivalent to using LINQ's .Select() to query only the needed fields
        public static ProjectionDefinitionBuilder<TDocument> Projection { get; }

		// Sort, creates sorting rules such as sorting by age
        public static SortDefinitionBuilder<TDocument> Sort { get; }

		// Update, alters the values of certain fields, etc.
        public static UpdateDefinitionBuilder<TDocument> Update { get; }

For further details, please refer to https://mongodb.github.io/mongo-csharp-driver/2.10/reference/driver/definitions/#projections.

Name Mapping

Since MongoDB differentiates fields based on case sensitivity, document fields generally use camelCase with a lowercase first letter, while C# field properties start with an uppercase letter. Therefore, it's necessary to map different names.

You can use the BsonElement attribute to set the mapping name.

class Person
{
    [BsonElement("fn")]
    public string FirstName { get; set; }

    [BsonElement("ln")]
    public string LastName { get; set; }
}

This is an introduction to MongoDB, but what are the advantages of using MongoDB? You can refer to this article from Alibaba Cloud: https://developer.aliyun.com/article/64352

The following scenarios are summarized:

  • Storing application logs. The logs are structured, easy to search, and can be exported in other formats for secondary use.

  • Adding fields without changing the table structure, allowing for flexible modifications.

  • Supports JSON format import; similar data structures to JSON; can easily restore object properties and store data in one go; traditional databases would require creating multiple tables and setting primary/foreign key relationships.

  • Clustering. Distributed clusters can handle massive amounts of data and are easily scalable; failover ensures service availability.

  • Addressing distributed file storage needs.

  • Flexible indexing methods.

  • ... ...

痴者工良

高级程序员劝退师

文章评论